A DECOMPOSITION THEORY FOR BINARY LINEAR CODES 



NAVIN KASHYAP 

Abstract. The decomposition theory of matroids initiated by Paul Seymour in the 1980's has had 
an enormous impact on research in matroid theory. This theory, when applied to matrices over the 
binary field, yields a powerful decomposition theory for binary linear codes. In this paper, we give 
an overview of this code decomposition theory, and discuss some of its implications in the context of 
the recently discovered formulation of maximum-likelihood (ML) decoding of a binary linear code 
over a discrete memoryless channel as a linear programming problem. We translate matroid-theoretic 
results of Grotschel and Truemper from the combinatorial optimization literature to give examples of 
non-trivial families of codes for which the ML decoding problem can be solved in time polynomial 
in the length of the code. One such family is that consisting of codes C for which the codeword 
polytope is identical to the Koetter-Vontobel fundamental polytope derived from the entire dual code 
C^. However, we also show that such families of codes are not good in a coding-theoretic sense 
— either their dimension or their minimum distance must grow sub-linearly with codelength. As a 
consequence, we have that decoding by linear programming, when applied to good codes, cannot 
avoid failing occasionally due to the presence of pseudocodewords. 



1. Introduction 

Historically, the theory of error-correcting codes has benefited greatly from the use of techniques 
from algebra and algebraic geometry. Combinatorial and graph-theoretic methods have also proven 
to be useful in the construction and analysis of codes, especially within the last ten years since 
the re-discovery of low-density parity-check codes. However, one area of mathematics that has, 
surprisingly, only played a relatively minor role in the development of coding theory is the field of 
matroid theory. The reason this is surprising is that, as anyone with even a basic understanding of 
the two fields would realize, coding theory and matroid theory are very closely related. The former 
deals with matrices over a finite field F, and these objects are also of fundamental importance in the 
latter, where they go under the name of F-representable matroids. 

The connection with matroid theory has not gone unnoticed among coding theorists. Indeed, as 
far back as 1976, Greene (24] noticed that the Tutte polynomial of a matroid, when specialized to 
a linear code C, was equivalent (in a certain sense) to the homogeneous weight-enumerator poly- 
nomial Wc{x, y) = where Ai is the number of words of weight i in C, and n is the 
length of C. The MacWilliams identities are then a special case of a known identity for the Tutte 
polynomial [9, Chapter 4]. The connection with matroids was also exploited by Barg |3| to derive 
Mac Williams-type identities for generalized-Hamming-weight enumerators of a code. 

However, aside from such use of tools from matroid theory to re-derive results in coding theory 
that had already been proved by other means, each field seems to have had little impact on the 
other. Matroid theory has derived its inspiration largely from graph theory, and its most successful 
applications have traditionally been in the areas of combinatorial optimization and network flows. 
Very recently, matroid theory has found new applications in the rapidly evolving field of network 
coding lilll . 
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On the other hand, coding theory (by which we mean the theory of error-correcting codes, in 
contrast to the theory of network coding) has been largely unconcerned with developments in com- 
binatorial optimization, as the fundamental problems in the former seemed to be of a different 
nature from those in the latter. However, the recent re-formulation of the maximum-likelihood 
(ML) decoding problem, for a binary linear code over a discrete memoryless channel, as a lin- 
ear programming problem [15] has opened a channel through which matroid-theoretic results in 
combinatorial optimization can be applied to coding theory. The key tool in these results is the 
use of the decomposition theory of matroids initiated by Seymour ll42]| . fl3ll . Based on Seymour's 
seminal work, Grotschel and Truemper 1251 showed that the minimization of a linear functional 
over the cycle polytope of a binary matroid could be solved in polynomial time for certain classes 
of matroids. This immediately implies that for the corresponding families of codes, the ML de- 
coding problem can be solved in time polynomial in the length of the code. Given the fact that 
the ML decoding problem is known to be NP-hard in general H, the existence of "non-trivial" 
classes of codes for which ML decoding can be implemented in polynomial time, is obviously a 
significant result. However, as we will show in this paper, for a code family to which the Grotschel- 
Truemper result applies, either the dimension or minimum distance of the codes in the family grows 
sub-linearly with codelength. Thus, such code families are not good from a coding-theoretic per- 
spective. However, they do illustrate the important point that polynomial-time ML decoding is 
possible. Moreover, the matroid-theoretic arguments used by Grotschel and Truemper do not rule 
out the possibility that there may exist other code families for which polynomial-time ML decoding 
algorithms exist, which are also good in terms of rate and minimum distance. 

The primary goal of this paper is to provide an exposition of the ideas needed to understand 
and apply the work of Grotschel and Truemper. As mentioned earlier, their work relies upon the 
machinery provided by Seymour's matroid decomposition theory, and so we will first present that 
theory in a coding-theoretic setting. Our presentation of this decomposition theory will be of a 
tutorial nature. We have attempted to keep the presentation self-contained to the extent possible; 
we do not provide complete proofs of some of the difficult theorems that form the basis of the 
theory. 

We provide the relevant definitions and background from matroid theory in Section |2] of this 
paper. As explained in that section, binary matroids and binary linear codes are essentially the same 
objects. So, techniques applicable to binary matroids are directly applicable to binary linear codes 
as well. In particular, matroid decomposition techniques can be specialized to codes. 

Of central importance in matroid theory is the notion of matroid minors. In the context of codes, 
a minor of a code C is any code that can be obtained from C by a sequence of shortening and 
puncturing operations. Minors have received little (if any) attention in coding theory, and this 
seems to be a remarkable oversight given the fact that they sometimes capture important structural 
properties of a code. For example, the presence or absence of certain minors (as stated precisely 
in Theorem 12.11 ) decides whether or not a code is graphic, i.e., has a parity-check matrix that is 
the vertex-edge incidence matrix of some graph. Graphic codes have been studied previously in 
the information theory literature ||26l,|l31], but the excluded-minor characterization of these codes 
appears to have been overlooked in these studies. 

In Section[3l we introduce a notion of A;-connectedness for codes, which is again a specialization 
of the corresponding notion for matroids. This is closely related to ^-connectedness in graphs, and 
interestingly enough, is also related to the trellis complexity of a code [17] . We do not explore the 
latter relationship in any detail in this paper, instead referring the reader to [|32l . [|33l . where matroid 
methods are used to study the structure of codes with low trellis complexity. 

The notion of ^-connectedness plays an important role in Seymour's decomposition theory. An 
idea of why this is so can be obtained from the simple case of 2-connectedness: it follows from 
the relevant definitions that a code is not 2-connected if and only if it is the direct sum of smaller 
codes. Similar statements can be made for codes that are not 3- or 4-connected (or more precisely. 
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not internally 4-connected — see Definition l4.4l) via the code-composition operations of 2-sum and 
3-sum introduced by Seymour 1421 . These operations, as well as the 3-sum which is in a sense the 
dual operation to the 3-sum, are explained in detail in Section |4l 

The operations of 2-, 3- and 3-sum have the non-trivial property that when two codes Ci and C2 
are composed using one of these sums to form a code C, then Ci and C2 are (up to code equivalence) 
minors of C. The relationship between fc-connectedness and these sums can then be summarized 
as follows: a binary linear code is 2-connected but not 3-connected (resp. 3-connected, but not 
internally 4-connected) iff it can be expressed as the 2-sum (resp. 3- or 3-sum) of codes Ci and 
C2, both of which are equivalent to minors of C. It follows immediately from the above facts 
that any binary linear code is either 3-connected and internally 4-connected, or can be constructed 
from 3-connected, internally 4-connected minors of it by a sequence of operations of coordinate 
permutation, direct sum, 2-sum, 3-sum and 3-sum. In fact, given any code, such a decomposition 
of the code can be obtained in time polynomial in the length of the code. 

This code decomposition theory has immediate applications to families of codes that are minor- 
closed in the sense that for each code C in such a family (t, any code equivalent to a minor of C 
is also in (t. Indeed, a code C is in a minor-closed family only if the indecomposable {i.e., 3- 
connected, internally 4-connected) pieces obtained in the aforementioned decomposition of C are 
in £. The above necessary condition is also sufficient if the code family is additionally closed 
under the operations of direct sum, 2-sum, 3-sum and 3-sum. Thus, membership of an arbitrary 
code in such a family £ can be decided in polynomial time iff the membership in of 3-connected, 
internally 4-connected codes can be decided in polynomial time. A formal statement of these and 
other related facts can be found in Section [5] of our paper. 

As an illustrative example, we also outline in Section [5] one of the major applications of Sey- 
mour's decomposition theory. This concerns the family of regular codes which are codes that do 
not contain as a minor any code equivalent to the [7, 4] Hamming code or its dual. Regular codes 
are also characterized by the property that given any parity-check matrix H of such a code, the 1 's 
in H can be replaced by ±l's in such a way that the resulting 0/ ± 1 matrix is totally unimodular. 
A totally unimodular matrix is a real matrix all of whose square submatrices have determinants in 
{0, 1, — 1}. These matrices are of fundamental importance in integer linear programming problems 
ll28l . Seymour [42J proved that a binary linear code is regular iff it can be decomposed into codes 
that are either graphic, or duals of graphic codes, or equivalent to a special [10, 5, 4] code he called 

RlQ. 

The application of the decomposition theory to linear programming, and in particular to ML 
decoding, is the subject of Section [6l Feldman et al. showed that the ML decoding problem for a 
length-n binary linear code C over a discrete memoryless channel can be formulated as a minimiza- 
tion problem min 7jCj, where 7 = (71, ... , 7„) € M" is a certain cost vector derived from the 
received word and the channel transition probabilites, and the minimum is taken over all codewords 
c = (ci, . . . , c„) in C. Now, if C is a graphic code, then standard graph-theoretic techniques from 
combinatorial optimization can be used to find the minimizing codeword in time polynomial in n; a 
sketch of such an algorithm can be found in Appendix iBl Grotschel and Truemper [25] additionally 
showed that this minimization could also be performed in polynomial time for certain minor-closed 
code families that are "almost-graphic" in a certain sense. Such a code family (J is characterized 
by the property that there exists a finite list of codes 2) such that each C € can be decomposed 
in polynomial time in such a way that at each step of the decomposition, one of the pieces is either 
graphic or in 2). 

Grotschel and Truemper gave a polynomial-time algorithm that takes as input a length-n code 
C from an almost-graphic family and a cost vector 7 G M", and constructs a codeword c € C 
achieving min ^ ^ 7jCi by solving related minimization problems over the pieces of the decomposi- 
tion that are graphic or in 2). This algorithm is also outlined in Appendix|B] Thus, the ML decoding 
problem can be solved in polynomial time for almost-graphic codes. 
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Grotschel and Truemper also gave several examples of almost-graphic families. Interestingly 
enough, one of these families is that consisting of codes C for which the codeword poly tope (i.e., 
the convex hull in M" of the codewords in the length-n code C) is identical to the Koetter-Vontobel 
fundamental polytope [55] derived from the entire dual code C"*-. 

Unfortunately, the one truly original result in this paper is a negative result. We show that for 
codes in an almost-graphic family, either their dimension or their minimum distance grows sub- 
linearly with codelength. One important implication of this is that decoding by linear program- 
ming, when applied to any good error-correcting code, must inevitably hit upon the occasional 
pseudocodeword, thus resulting in decoding failure. 

We make some concluding remarks in Section |7] Some of the lengthier or more technical proofs 
of results from Sections |4] and [6] are given in appendices to preserve the flow of the presentation. 

2. Matroids and Codes 

We shall assume familiarity with coding theory; for relevant definitions, see |[36l . We will mainly 
concern ourselves with binary linear codes, and use standard coding-theoretic notation throughout 
this paper. Thus, an [n, k] code is a code of length n and dimension k, and an [n, k, d] code is an 
[n, k] code that has minimum distance d. Given a code C, dim(C) denotes the dimension of C, and 
C-*- denotes the dual code of C. 

The main purpose of this section is to introduce concepts from matroid theory that are applicable 
to coding theory. We will largely follow the definitions and notation of Oxley 138.1 . We begin with 
a definition of matroids. 

Definition 2.1. A matroid M is an ordered pair (E, I) consisting of a finite set E and a collection 
X of subsets of E satisfying the following three conditions: 

(i) G T; 

(ii) if I € I and J C I, then J G I; and 

(iii) ifli,l2 are in I and \Ii\ < I/2I. then there exist^e I2 — Ii such that Ii U {e} G I. 

The set E above is called the ground set of the matroid M, and the members of 1 are the 
independent sets of M. A maximal independent set, i.e., a set B G X such that B U {e} ^ 2 for 
any e ^ E — B,is called a basis of M. It is a simple consequence of (iii) in Definition 12. II that all 
bases of M have the same cardinality. The cardinality of any basis of M is defined to be the rank 
of M, denoted by r(M). 

A subset of E that is not in T is called a dependent set. Minimal dependent sets, i.e., dependent 
sets all of whose proper subsets are in I, are called circuits. It easily follows from the definitions 
that a subset of is a dependent set if and only if it contains a circuit. A dependent set that can be 
expressed as a disjoint union of circuits is called a cycle. 

The above definitions of independent and dependent sets, bases and rank simply try to abstract 
the notion of independence and dependence, bases and dimension, respectively, in a vector space 
over a field. Indeed, the most important class of matroids for our purposes is the class of binary 
matroids, which are simply vector spaces over the binary field, or to put it another way, binary linear 
codes. 

Let H be. a. binary m x n matrix, and let vi , V2 , . . . , v„ denote the column vectors of H. Set 
E = {1,2, ... ,n} and take 2 to be the collection of subsets I = {ii,i2, ■ ■ ■ ,is} C E such that 
the sequence of vectors Vj^ , , • • • , Vj^ is linearly independent over the binary field F2. It follows 
from elementary linear algebra that {E, 2) satisfies the definition of a matroid given above, and thus 
defines a matroid which we shall denote by M[H]. Note that r{M[H]) equals the rank (over F2) of 
the matrix H. 



In this paper, we will use A — Bio denote the set difference A n B'^ . The more usual notation A\B has been reserved 
for the matroid operation of "deletion". 
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A matroid M = {E,l) is called binary if it is isomorphic to M[H] for some binary matrix 
H. Here, we say that two matroids M = {E,l) and M' = {E',1') are isomorphic, denoted by 
M = M', if there is a bijection ip : E E' such that for all J C -E, it is the case that J £ I if and 
only i^^p{J) 

A binary matrix H is also the parity-check matrix of some binary linear code C. Note that 
r{M[H]) = n — dim(C). The code C and the binary matroid M[H] are very closely related. Recall 
from coding theory that a codeword c = (ciC2 . . . c„) E C, c 7^ 0, is called minimal if its support 
supp(c) = {i : Ci = 1} does not contain as a subset the support of any other nonzero codeword 
in C. It is easily seen that c is a minimal codeword of C iff its support is a circuit of M[H]. It 
follows from this that for any c G {0, 1}", c 7^ 0, we have c G C iff supp(c) is a cycle of M[H]. 
Furthermore, a routine verification shows that for binary matrices H and H', M[H] = M[H'] iff 
H and H' are parity-check matrices of the same code C. This allows us to associate a unique binary 
matroid with each binary linear code C, and vice versa. 

Thus, binary matroids and binary linear codes are essentially the same objects. In particular, two 
codes are equivalent if and only if their associated binary matroids are isomorphic. This association 
between codes and binary matroids allows us to use tools from matioid theory to study binary linear 
codes. 

While many of the tools used to study matroids have their roots in linear algebra, there is another 
source that matroid theory draws from, namely, graph theory. Indeed, Whitney's founding paper on 
matroid theory |[56]| was an attempt to capture the fundamental properties of independence that are 
common to graphs and matrices. 

Let Q be a. finite undirected graph (henceforth simply "graph") with edge set E. Define cyc to 
be the collection of edge sets of cycles (i.e., closed walks) in Q. Define I C to be independent if 
/ does not contain any member of cyc as a subset. Equivalently, / is independent if the subgraph 
of G induced by / is a forest. Setting T to be the collection of independent subsets of E, it turns 
out that {E,l) is a matroid [.38. Proposition 1.1.7]. This matroid is called the cycle matroid of Q, 
and is denoted by M (Q). A matroid that is isomorphic to the cycle matroid of some graph is called 
graphic. 

Clearly, the circuits of M{Q) are the edge sets of simple cycles (i.e., closed walks in which no 
intermediate vertex is visited twice) in Q. The nomenclature "cycle" for the disjoint union of circuits 
in a matroid actually stems from its use in the context of graphs. The bases of M(^) are the unions 
of edge sets of spanning trees of the connected components of Q. Hence, r{M{Q)) = \V{G) \ — t, 
where V{Q) is the set of vertices of Q, and t is the number of connected components of Q. 

It is not hard to show that a graphic matroid is binary. Indeed, let A be the vertex-edge incidence 
matrix of G- This is the matrix [aij] whose rows and columns are indexed by the vertices and 
edges, respectively, of Q, where aij = 1 if the jth edge is incident with the ith vertex, and aij = 
otherwise. It may be verified that M{G) = M[A] (see, e.g., |38, Proposition 5.1.2]). 

Given a graph G, we will denote by C{G) the code associated (or identified) with the binary 
matroid M{G)- In other words, C{G) is the binary linear code that has the vertex-edge incidence 
matrix of ^ as a parity-check matrix. We will refer to such codes as graphic codes, and denote 
by r the set of all graphic codes. Graphic codes have made their appearance previously in the 
information theory literature 1111261 (also see 131] and the references therein). 

The repetition code of length n is a graphic code; it is the code obtained from the n-cycle C„, 
the graph consisting of a single cycle on n vertices. However, not all binary codes are graphic. For 
example, it can be shown that the [7,4] Hamming code is not graphic. It is possible to give a precise 
characterization of the codes that are graphic in terms of excluded minors, a notion we need to first 
define. 



In coding theory, two binary linear codes are defined to be equivalent if one can be obtained from the other by a 
permutation of coordinates. In this paper, we will use the notation 7r(C) to denote the code obtained by applying the 
coordinate permutation tt to the code C. 
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There are two well-known ways of obtaining codes of shorter length from a given parent code. 
One is via the operation of puncturing, in which one or more columns are deleted from a generator 
matrix of the parent code [36 , p. 28]. The second method is called shortening, and involves one or 
more columns being deleted from a parity-check matrix of the parent code |i36 , p. 29]. Given a code 
C of length n with generator matrix G, and a subset J C {1,2,..., n}, we will denote by C/ J the 
code obtained by puncturing the columns of G with indices in J, and by C \ J the code obtained by 
shortening at the columns of G with indices in J. Note that C/ J is simply the restriction of the code 

C onto the coordinates not in J, and C\J = {C^ / J)'^ . The notation, though potentially confusing, 
has been retained from matroid theory, where the analogues of puncturing and shortening are called 
contraction and deletion, respectively. 

Definition 2.2. A minor of a code C is any code obtained from C via a (possibly empty) sequence 
of shortening and puncturing operations. 

It may easily be verified that the precise order in which the deletion and puncturing operations 
are performed is irrelevant. Hence, any minor of C may be unambiguously specified using notation 
of the form C/X\Y (or equivalently, C \ Y/X) for disjoint subsets X,Y C {1, 2, ... , n}; this 
notation indicates that C has been punctured at the coordinates indexed by X and shortened at the 
coordinates indexed by Y . 

The above definition allows a code to be a minor of itself. A minor of C that is not C itself is 
called a proper minor of C. Minors have not received much attention in classical coding theory, but 
they play a central role in matroid theory. We will not touch upon the subject of minors of general 
matroids, leaving the reader to refer to |[38l Chapter 3] instead. However, we will briefly mention 
how the matroid operations of deletion and contraction specialize to the cycle matroids of graphs. 

Let Q be some graph, with edge set E. Given e ^ E, define the graph \ e to be the graph 
obtained by deleting the edge e along with any vertices that get isolated as a result of deleting e. 
Also, define ^/e to be the graph obtained by contracting e, i.e., deleting e and identifying the two 
vertices incident with e. The process of obtaining Q /e from Q is called edge contraction, and that of 
obtaining Q\e from Q is of course called edge deletion. These operations are inductively extended 
to define Q\J and ^/ J for any J C E. A minor of a graph Q is any graph obtained from Q via a 
(possibly empty) sequence of edge deletions and contractions. 

The operations of edge deletion and contraction are the graphic analogues of code shortening and 
puncturing, respectively. A mathematically precise statement of this is as follows: given a graph Q 
with edge set E, and any J C E,we have |[38l Equation 3.1.2 and Proposition 3.2.1] 



It follows that any minor of a graphic code is graphic. 

Returning to the question of determining which codes are graphic, the answer can be succinctly 
given in terms of a list of forbidden minors by the following result of Tutte |[50l . 

Theorem 2.1 (15011). A code is graphic if and only if it does not contain as a minor any code 
equivalent to the [7,4] Hamming code or its dual, or one of the codes C{K^)-^ and C{K^^-^)-^. 

In the statement of the above theorem, is the complete graph on five vertices, while K3 3 is 
the complete bipartite graph with three vertices on each side. C{K^)^ is the [10,4,4] code with 
generator matrix 



c{g)/j = c{G/j) and e{G)\j = c{g\j). 
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while C(K3^3) is the [9, 5, 3] code with generator matrix 

" 1 1 1 " 
1 1 1 
001000011 . 
1 1 1 
1 1 1 1 1 

A proof of the theorem can be found in f38', Section 13.3]. On a related note, several authors 
have given algorithms for deciding whether or not a given code is graphic lU |6l |211 [5T1, P8l 
Section 10.6]. These algorithms run in time polynomial in the size of the input, which can be a 
parity-check matrix for the code. We will use this fact later in the paper. 

3. Connectedness 

As mentioned previously, matroid theory draws upon ideas from graph theory. A key concept in 
graph theory is the notion of A;-connectedness for graphs |7, Section III.2]. Given a graph Q, we 
will let V{Q) denote the set of its vertices. A graph is connected if any pair of its vertices can be 
joined by a path; otherwise, it is disconnected. A maximal connected subgraph of is a connected 
component, or simply component, of Q. Let Q — W denote the graph obtained from Q by deleting 
the vertices in W and all incident edges. If Q is connected and, for some subset W C V{Q), Q — W 
is disconnected, then we say that is a vertex cut of Q, or that W separates Q. If is a connected 
graph that has at least one pair of distinct non-adjacent vertices, the connectivity k{Q) of G is defined 
to be the smallest integer j for which Q has a vertex cut W with \ W\ = j. If ^ is connected, but 
has no pair of distinct non-adjacent vertices, k{Q) is defined to be — 1. Finally, if Q is 

disconnected, then we set k{G) = 0. For an integer k > 0,Q is said to be k-connected if k{Q) > k. 
Thus, a graph Q with > 2 is connected if and only if it is 1-connected. 

The notion of A;-connectedness of graphs can be extended to matroids, but it has to be done 
carefully. One of the problems encountered when attempting to do so is that 1 -connectedness of 
graphs does not extend directly to matroids. The reason for this is that for any disconnected graph 
Qi, there is a connected graph C/2 such that M{Qi) = M(^2) lEH Proposition 1.2.8]. So, the link 
between fc-connectedness in graphs and that defined below for matroids begins with the case k = 2. 

The definition we present of ^^-connectedness for matroids was formulated by Tutte f5T\. We 
will once again restrict our attention to the case of binary matroids (i.e., codes) only. Let C be a 
binary linear code of length n. We will hereafter use [n] to denote the set of integers {1,2,... ,n}, 
and for J C [n], we set J'^ = {i £ [n] : i ^ J}. To further alleviate notational confusion, for 
J C [n], we will define C| j to be the restriction of C onto its coordinates indexed by J. Equivalently, 
C\j = C/ J'^, the latter being the code obtained from C by puncturing the coordinates not in J. 

Definition 3.1. For a positive integer k, a partition ( J, J'^) of [n] is called a fc-separation of C if 

min{|J|,|J^|} > (1) 

and 

dim(C| j) + dim(C| jc) - dim(C) < k - 1. (2) 
IfC has a k-separation, then C is said to be A;-separated. 

When equality occurs in ([T]), ( J, J'^) is called a minimal k-separation. When equality occurs in 
( J, J^) is called an exact k-separation. Note that the expression on the left-hand side of (O 
is always non-negative, since dim(C| j) + dim(C| jc) > dim(C) for any J C [n]. This fact easily 
yields the following result. 

Lemma 3.1. C is 1-separated iff it is the direct sum of non-empty codes. 
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Proof. Since dim(C|j) + dim(C|jc) > dim(C) for any J C [n], we see from Definition 13. II that 
(J, J"^) is a 1-separation of C iff J, J'^ are non-empty, and dim(C| j) +dim(C| jc) = dim(C). Hence, 
(J, J'^) is a 1-separation of C iff J, are non-empty, and C is the direct sum of C\j and C\jc. □ 

We now give the definition of /c-connectedness of codes. Note that this definition starts with 
k = 2. 

Definition 3.2. For k > 2, a code C is defined to be fc-connected if it has no k' -separation for any 

k' < k. 

Example 3.1. Let C be the [7,3,4] simplex code with generator matrix 

" 1 1 1 1 
G= 10 10 11 
1110 1 

Setting J = {1,2,3,7}, we see that (J, J^) forms a 3-separation of C. Indeed, the rank of the 
submatrix of G formed by the columns indexed by J is 3, while the rank of the submatrix formed by 
the columns indexed by J'^ is 2. In other words, dim(C| j) = 3 and dim(C| jc) = 2. Thus, both ([7]) 
and (l2l) are satisfied with equality, which makes ( J, J*^) a minimal as well as an exact separation. 

It may be verified (for example, by exhaustive search) that there are no 1- or 2- separations for 
C, and all 3-separations are minimal and exact. In particular, C is 2- and 3-connected, but not 
4-connected. 

The quantity, dim(C| j) + dim(C| jc) — dim(C), appearing on the left-hand side of (O in Defini- 
tion [STT] also arises as part of the definition of the state-complexity profile of a minimal trellis repre- 
sentation of a code [ 161- ll20]| . |[29il .ll341. To be precise, given a length-n code C, the state-complexity 
profile of a code l.29> Equation (1)] is defined to be the vector s(C) = (so(C), . . . , s„(C)), where 
Si{C) = dim(C| j) + dim(C| jc) - dim(C) for J = [i] c [n]. Here, [0] is defined to be the null set 
0. It is known fTTl that s(C) = s(C^), or equivalently, for any J C [n], 

dim(C|j) + dim(C| jc) - dim(C) = dim(C^| j) + dim(C^| jc) - dim(C^). 

As a result, we obtain the interesting and useful fact, stated in the proposition below, that k- 
connectedness is a property that is invariant under the operation of taking code duals. 

Proposition 3.2. Let C be a binary linear code of length n. For any k > 1, a partition (J, J^) of 
[n] is a k-separation of C iff it is a k-separation of C^. Therefore, for any k > 2, C is k-connected 
ijf is k-connected. 

Consider again the simplex code of length 7 from Example 13.11 By Proposition 13.21 above, its 
dual — the [7, 4] Hamming code — is also 2- and 3-connected, but not 4-connected. 

The link between graph and code /^-connectedness is strong, but they are not equivalent notions. 
The closest relation between the two occurs when k = 2. If ^ is a loopless graph without iso- 
lated vertices and > 3, then C{G) is 2-connected iff Q is a 2-connected graph [38, Proposi- 
tion 4. 1.8]. To describe the relation between graph and code connectedness in general, we define the 
connectivity, A(C), of a code C to be the least positive integer k for which there is a /c-separation of 
C, if some /c-separation exists for C; A(C) is defined to be oo otherwise. Note that C is A;-connected 
iff A(C) > k, and by Proposition [ill A(C) = A(C^). It can be shown (381 Corollary 8.2.7] that for 
a connected graph Q / having at least three vertices, 

X{C{g))=mm{K{g),g{g)}, 

where g{Q) denotes the girth (length of shortest cycle) of Q. 

Now, our reason for presenting a notion of connectedness for codes is not just that it extends 
an idea from graph theory. Certain methods of code composition have been developed in matroid 
theory that relate to 2- and 3-separations. These code composition methods can be considered to 
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be generalizations of direct sums, and they allow the result of Lemma |3?T] to be extended in a non- 
trivial manner, paving the way for the powerful decomposition theory of binary matroids initiated by 
Paul Seymour [42]. This decomposition theory allows one to decompose a binary linear code into 
smaller codes in a reversible manner, in such a way that the smaller codes are equivalent to minors 
of the original code. As we shall describe in detail in the next section, to find such a decomposition 
of a code, we need to find 1-, 2- or 3-separations in the code, if such separations exist. 

For any fixed positive integer k, there are polynomial-time algorithms known (see {W. Section 
8.4]) that, given a binary linear code C, either find a /^-separation of C, or conclude that no such 
separation exists. Here, by "polynomial-time algorithm," we mean an algorithm that runs in time 
polynomial in the length of C. For instance, the problem of deciding the existence of 1 -separations 
in a code C is almost trivial. To do so, one takes a matrix A that is either a generator matrix or 
a parity-check matrix of C, brings A to reduced row-echelon form (rref), removes all-zero rows if 
they exist, and finally constructs a certain bipartite graph BG{A). For an m x n matrix A, the 
graph BG{A) is defined as follow^]: the vertex set of BG{A) consists of a set of n left vertices 
{/i, . . . , In} and a set of m right vertices {ri, . . . , r^}', an edge connects the vertices Ij and rj 
iff the («, j)th entry of A is 1. The code C is 2-connected {i.e., has no 1-separation) iff BG{A) 
is connected BSl Lemma 3.3.19]. If C is not 2-connected, the connected components of BG{A) 
induce the required 1-separation. 

In general, for fixed integers k, I with / > k, the problem of finding a ^-separation (J, J'^) of a 
code, with min{| J|, | J^|} > ^, if it exists, can be solved in time polynomial in the length of the 
code, by an algorithm due to Cunningham and Edmonds (in [10]). We sketch the idea here. The 
algorithm is based on the fact that the following problem can be solved in time polynomial in n: for 
codes Ci and C2 each of length n, find a partition of [n] that achieves 

min{dim(Ci|jJ + dim(C2|j2) : (Ji, J2) is apartition of [n]}. (3) 

The above problem is solved using the matroid intersection algorithm BSl Section 5.3], which 
we do not describe here. 

The /^-separation problem of interest to us is equivalent to the following problem for a fixed 
integer I > k: given a code C of length n, find a partition of [n] that achieves 

min{dim(C| jj + dim(C|j2) : (Ji, J2) is apartition of [n], min{| Ji|, | J2I} > I}. (4) 

Indeed, a /c-separation (| J|, | J^|) of C, with min{| J|, \ J'^\} > exists iff the minimum in (jU) is 
at most dim(C) + k — 1. Now, the minimization in (HJl can be solved by finding, for each pair of 
disjoint /-element subsets -Ei, £'2 C [n], the partition ( Ji, J2) of [n] that achieves 

min{dim(C| Jl) + dim(C|j2) : (Ji, J2) is apartition of [n], Ji D Ei, J2 D £2}. (5) 

If (H) can be solved in time polynomial in n for each pair of disjoint /-element subsets Ei, E2 C [n], 
then ^ can also be solved in time polynomial in n, since there are 0{v?^) pairs {Ei, E2). 

It turns out that ^ can be solved in time polynomial in n by converting it to a minimization of 
the form in (O, which, as mentioned above, can be solved in polynomial time. The trick is to set 
Ci = C/E2\Ei and C2 = C j E\\E2, which are both codes of length n — |E'i U i?2|- For notational 
convenience, we will let the coordinates of C\ and C2 retain their indices from C, i.e., the set of 
coordinate indices for C\, as well as for C2, is [n] — (£'1 U £'2)- It rnay easily be verified that for 
J C [n] — (£1 □£'2), dim(Cj|j) = dim(C| jueJ — dim(C|£;J, z = 1,2. Therefore, for any partition 
( Jl, J2) of [n] — (£1 U £2), setting Jj = Jj U £j, i = 1, 2, we have 

dim(Ci|jJ + dim(C2|j2) = dim(C|jJ + ^m\{C\-j^ - dim(C|£;i) - dim(C|£;2), (6) 

and ( Jl, J2) is a partition of [n]. Conversely, for any partition {Ji,J2) of [n], setting Ji = Ji — Ei, 
i = 1,2, we see that ( Ji, J2) forms a partition of [n] — (£1 U £2), and Q is once again satisfied. 



'If A is a parity-check matrix of the code, then BG{A) is simply the corresponding Tanner graph. 



10 



N. KASHYAP 



Thus, we see that for a fixed pair of disjoint /-element subsets Ei,E2 C [n] , given a code C of 
length n, if we set Ci = C / E2\Ei and C2 = C / Ei\E2, then the minimum in ^ is equal to 

min{dim(Ci[jJ + dim(C2|j2) ■ (-^i' J'i) is apartition of [n]} + dim(C|£;J + dim(C|£;2)- 

Since the minima in ([3]) and (|5]) just differ by a constant, the minimization problem for given 
C and (£'i,i?2) is equivalent to the minimization problem (|3]l with Ci = C/E2\Ei and C2 = 
C/Ei \E2. Therefore, since ^ can be solved in time polynomial in n, so can 

The above sketch does indeed give a polynomial-time algorithm for determining A;-separations 
(J, J'^) with min{| J|, \J'^\} > /, but the complexity of the algorithm is 0(n^'"^"') for some constant 
ai that arises from the matroid intersection algorithm. Clearly, this is not very practical even for 
/ = 3 or 4, and as we shall see in the next section, these values of / come up in the implementation 
of Seymour's decomposition theory. A more efficient, albeit more involved, algorithm for finding 
2- and 3-separations is described in BSl Section 8.4]. 

As a final remark in this section, we mention that the fact that there exist algorithms for solving 
the minimization problems ©-([S]! that run in time polynomial in n neither contradicts nor sheds 
any further light on the NP-completeness results for the closely related problems considered in [,29,1 . 



4. Code Composition and Decomposition 

The code composition/decomposition methods described in this section were developed by Sey- 
mour in close analogy with a method of composing/decomposing graphs called clique-sum. In a 
clique-sum, two graphs, each containing a A'^ subgraph (A;-clique), are glued together by first pick- 
ing a A;-clique from each graph, sticking the two cliques together so as to form a single A;-clique 
in the composite graph, and then deleting some or all of the edges from this clique. A formal 
description of clique-sum can be found in [42J or in [38, p. 420]. 

Our exposition of these code composition/decomposition techniques is based on Seymour's pa- 
per ll42l . Let C and C be binary linear codes of length n and n' , respectively, and let m be an integer 
satisfying < 2m < min{n, n'}. We first define a code C \\m C as follows: if G = [gi g2 ... gn] 
and G' = [g'j^ g2 • • • g^'] are generator matrices of C and C , respectively, then C \\m C is the code 
with generator matrix 

gl • • • gn— m gn— m+1 • • • gn • • • 
••• g[ ■■■ S'm g'm+1 ••• gn' 

Thus, C \\m C is a binary linear code of length n + n' — m. This code is almost like a direct sum 
of C and C except that the two component codes overlap in m positions. Indeed, when m = 0, 
C \\m C is the direct sum of C and C. 

Codewords ofC \\m C are of the form c c', for c — ciC2 . • • , G C and c' = c'^Cg • • • , c^/ G 
C, where c \\m c' = ciC2 . . . Cn+n'-m is defined to be 

Ci for 1 < i < n — m 

Ci + c^_„+m torn — m + l<i<n 
Ci_n+rn ioTn+l<i<n + n' — m 

In other words, c \\m c' is the binary word of length n + n' — m composed as follows: the first 
n — m symbols of c \\m c' are equal to the first n — m symbols of c, the next m symbols of c \\m c' 
are equal to the coordinatewise modulo-2 sum of the last m symbols of c and the first m symbols 
of c', and the last n' — m symbols of c \\m c' are equal to the last n' — m symbols of c'. 

From C \\m C, we derive a new code, which we temporarily denote by Sm{C, C), by shortening 
at the m positions where C and C' are made to overlap. To be precise, letJ = {n — m + l,n — m-\- 
2, . . . , n}, and set Sm{C, C) = {C \\m C2) \J- Thus, Sm{C, C) is a code of length n + n' — 2m 
which, by choice of m, is greater than n and n'. Once again, note that So{C, C) = C ||o C = C(BC', 
where © denotes direct sum. 
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Figure 1 . The minimal trellis of the code C ©2 C of Example |411 



We will actually only be interested in two instances of the above construction (other than the 
direct-sum case of m = 0), one of which is presented next, and the other is introduced later in 
Section [ 



4.1. 2-Sums. The Sm{C, C) construction with m = 1 is called a 2-sum in certain special cases. 

Definition 4.1. Let C, C be codes of length at least three, such that 

(PI) ... 01 is not a codeword ofC or C^; 

(P2) 10 ... /i' not a codeword ofC or C'^. 
Then, 5i(C, C) is called the 2-sum ofC and C, and is denoted by C ©2 C. 

Note that 2-sums are only defined for codes having the properties (PI) and (P2) listed in Defini- 
tion |4?T] These properties can be equivalently stated as follows: 

(PI') ... 01 is not a codeword of C, and the last coordinate of C is not identically zero; 
(P2') 10 ... is not a codeword of C, and the first coordinate of C is not identically zero. 

As we shall see below, (PI') and (P2') are more directly relevant to an analysis of the 2-sum con- 
struction. 

Example 4.1. Let C be the [7, 3, 4] simplex code with the generator matrix given in Example \3.1\ 
As this code satisfies both (PI) and (P2) in Definition \4.1\ we can define C ©2 C. Carrying out the 
2-sum construction yields the [12, 5, 4] code C ©2 C with generator matrix 



G 



100011000111 
010101000111 
001110000111 
000000101011 
000000011101 



The minimal trellis for this code is shown in Figure\l\ It is easily (from the state-complexity profile 
of the minimal trellis, for example) that, with J = {1,2,3,4,5,6}, {J,J'^) is a 2-separation of 
C ©2 C. It may further be verified that C ©2 C has no I -separation, meaning that it is 2-connected. 
Finally, we note that by shortening C ©2 C af the 7th and 8th coordinates, and puncturing the 9th, 
11th and 12th coordinates, we obtain the simplex code C again. In other words, C is a minor of 
C ©2 C. These observations are not mere coincidences, as we shall see below. 

The dimension and minimum distance of a 2-sum C ©2 C can be related to the corresponding 
parameters of the component codes C and C Given a binary word x, we will use w{'k) to denote 
its Hamming weight, and for a code C, we will let d{C) denote the minimum distance of C. 

Proposition 4.1. Let C and C be codes for which C ©2 C can be defined. 

(a) dim(C ©2 C) = dim(C) + dim(C') - 1. 

(b) d{C ©2 C) < mm{d{C \{n}), d{C' \{1})}, where n is the length ofC. 
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Proof. Throughout this proof, n and n' denote the lengths of C and C , respectively. Also, we shall 
let X II x' denote the concatenation of binary words x and x'. 

(a) By definition, C®2C' = (C ||i C) \{n}, and so, the 2-sum is isomorphic to the subcode, £, of 
C 111 C consisting of those codewords ciC2 . . . Cn+n'-i such that c„ = 0. Since the last coordinate 
of C is not identically zero, f is a proper subcode of C ||i C , and hence, dim(C ©2 C) = dim(^) = 
dim(C 111 C) - 1. 

We claim that the direct sum C © C is in fact isomorphic (as a vector space over F2) to C ||i C . 
Indeed, consider the map (j) : C © C — > C ||i C defined via (j){c \\ c') = c ||i c', for c G C and 
c' G C . This is a homomorphism onto C ||i C , but since ... 01 ^ C and 10 ... ^ C, we have 
ker((/)) = {0}, which shows that (f) is in fact an isomorphism. 

Therefore, dim(C ©2 C) = dim(C ||i C) - I = dim(C © C) - 1, which proves the result. 

(b) Let c be a minimum-weight codeword in C \ {n}. Since cO G C, we have, by construction of 
C ©2 C, c 111 G C ©2 C Thus, d{C ©2 C) < w{c) = d{C \{n}). A similar argument shows that 
d{C®2C')<d{C'\{i}). □ 

An interesting property of 2-sums is that they behave just like direct sums under the operation of 
taking code duals. Note that by virtue of (PI) and (P2) in Definition 14. 1[ the 2-sum of C and C can 
be defined if and only if the 2-sum of their duals, and C'^ , can be defined. 

Proposition 4.2 ( |[38Tl . Proposition 7.1.20). Let C and C be codes for which C ©2 C can be defined. 
Then, 

(C ®2C't =C^ ©2 C'^- 



This result can be trivially derived from Theorem 7.3 in lITSll . but for completeness, we give a 
simple algebraic proof in Appendix El As an example, the above result implies that the matrix G 
given in Example 14. H is the parity-check matrix of the 2-sum of two copies of a [7,4] Hamming 
code. 

While the properties of 2-sums presented above are interesting, the usefulness of 2-sums actually 
stems from the following theorem of Seymour [42], which is a result analogous to Lemma |3?T] 

Theorem 4.3 (||421, Theorem 2.6). IfCi and C2 are codes of length rii and n2, respectively, such that 
C = Ci ©2 C2, then (J, J^) is an exact 2-separation ofC, for J = {1, 2, . . . , ni — 1}. Furthermore, 
Ci and C2 are equivalent to minors ofC. 

Conversely, if (J, J'^) is an exact 2-separation of a code C, then there are codes Ci and C2 of 
length I J| + 1 and | J'^l + 1, respectively, such that C is equivalent to Ci ©2 C2. 

The following corollary is a more concise statement of the above theorem, and is more in the 
spirit of Lemma im 

Corollary 4.4. A code C has an exact 2-separation iff there exist codes Ci and C2, both equivalent 
to proper minors ofC, such that C is equivalent to Ci ©2 C2. 

Another corollary ll38l Theorem 8.3.1], stated next, is a consequence of the fact that if C is a 
2-connected code, then any 2-separation of C must be exact; if not, the 2-separation would be a 
1 -separation as well, which is impossible as C is 2-connected. 

Corollary 4.5. A 2-connected code C is not 3 -connected iff there exist codes Ci and C2, both equiv- 
alent to proper minors ofC, such that C is equivalent to Ci ©2 C2. 

We will not prove Theorem 14. 3 1 in its entirety, referring the reader instead to Seymour's original 
proof, or the proof given in Oxley ll38l Section 8.3]. However, we will describe an efficient con- 
struction of the components of the 2-sum when an exact 2-separation {J,J^) of C is given, as it is 
a useful tool in code decomposition. This construction effectively proves the converse part of the 
theorem. Our description is based on the construction given in B8l Section 8.2]. 
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Let C be a code of length n and dimension k, specified by a k x n generator matrix G, and 
let (J, J^) be an exact 2-separation of C. By permuting coordinates if necessary, we may assume 
that J = {1,2,..., m} for some m < n. Let G\j and G\jc denote the restrictions of G to the 
columns indexed by J and J'^, respectively; thus, G = [G\j G\jc]. Let rank(G|j) = ki and 
rank(G| jc) = ^2; since (J, J^) is an exact 2-separation of C, we have ki + k2 = k + 1. Bring 
G into reduced row-echelon form (rref) over F2. Permuting coordinates within J and within J'^ if 
necessary, rref (G) may be assumed to be of the form 

Ik, A O B 
O O C 

where Ij, for j = ki, k2 — I, denotes the j x j identity matrix, Ais a.ki x (\J\ — ki) matrix, i? is a 
ki X (I J'^l — A;2 + 1) matrix, C is a {k2 — 1) x {\J'^\ — A;2 + 1) matrix, and the O's denote all-zeros 
matrices of appropriate sizes. As a concrete example, consider the matrix G given in Example 14.11 
which is indeed of the above form, with I J| = I J^l = 6, fci = A;2 = 3, 



G 



(V) 








1 


1 ■ 




' 


1 


1 


1 " 




A = 


1 





1 


, B = 





1 


1 


1 


and C 




1 


1 










1 


1 


1 





The fact that the submatrix 



must have rank equal to rank(G| 



k2 implies 



O B 

that B must have rank 1. Hence, B is actually a matrix with at least one non-zero row, call it b, and 
at least one non-zero column, call it b. Also, each row of B is either or identical to b, and b is 
the length-fci column vector whose ith component is a 1 if the ith row of B is equal to b, and is 
otherwise. 
Now, define 

Gi = [Ik, A h] 

and 

1 0* b 



Go 



Ik,-i C 



[Ik2 C] 



where denotes an all-zeros column-vector, and C 



h 

G 



It is not hard to show that if Ci and C2 are the codes generated by Gi and G2, respectively, 
then Ci ©2 C2 is the code generated by the matrix G in dV]). Indeed, carefully going through the 
construction, it may be verified that all the rows of G are in Ci ©2 C2. Hence, dim(Ci ©2 C2) > 
rank(G) = ki + k2-l. However, by Propositionl4~]la). we have that dim(Ci ©2C2) = dim(Ci) + 
dim(C2) — l = ki + k2 — l. Hence, dim(Ci ©2 C2) = rank(G), implying that G must be a generator 
matrix for Ci ©2 C2. 

Example 4.2. For the matrix G in Example \4.1\ we find the matrices Gi and G2 to be 



Gi 



and Go 



1 





which are indeed the generator matrices of the two simplex codes whose 2-sum is represented by 
G. 

It can also be observed that Ci and C2 are minors of Ci ©2 C2. Indeed, to obtain Ci as a minor 
of Ci ©2 C2, we proceed as follows. Let j be the index of a column of G in which the submatrix 
B has a nonzero column. For the matrix of Example 14. 1[ j could be either 10, 11 or 12. Define 
Ji = {|J| + l,|J| + 2,...,|J| + A;2-l},and J2 = -{JiVJ{j}). Then, Ci = (Ci ffi2C2) \Ji/ J2. 
To obtain C2 as a minor, let j' be the index of a row of G in which the submatrix B has a nonzero 
row. For the matrix of Example 14. 1[ j' could be either 1, 2 or 3. The j'th column of G is of the 
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form [0 ... 1 ... 0]*, the single 1 being the j'th entry. Define J[ = {1, 2, . . . , fci} - {j'} and 
= {fci + 1, fci + 2, . . . , I J|}. Then, C2 = (Ci 02 C2) \ J{/J2- Proofs of these statements just 
involve consistency checking, so are left as an easy exercise. 

In summary, the procedure described above takes as input a.k x n generator matrix G for C, and 
an exact 2-separation (J, J^) of it, and produces as output a permutation vr of the coordinates of C, 
and the generator matrices of two codes Ci and C2, such that C = 7r(Ci 02^2). The codes Ci and 
C2 are both equivalent to proper minors of C. The entire procedure can be carried out in 0{k'^n) 
time, which is the run-time complexity of bringing a. k x n matrix to reduced row-echelon form 
via elementary row operations. All other parts of the procedure can be performed in 0{n) time; 
for example, since (J, J'^) is given, it only takes 0(n) time to find the permutation, vr"^, that takes 
rref (G) to the matrix G in ©. 

A straightforward combination of Lemma 13.11 and Corollary 14.51 yields the following theorem, 
which illustrates the utility of the matroid-theoretic tools presented so far. 

Theorem 4.6 ([38|, Corollary 8.3.4). Every code that is not 3-connected can be constructed from 
3 -connected proper minors of it by a sequence of operations of coordinate permutation, direct sum 
and 2-sum. 

The decomposition of a code via direct sums and 2-sums implicit in the above theorem can be 
carried out in time polynomial in the length of the code. This is due to the following two facts: 

(a) as described in Section [3l there are polynomial-time algorithms for finding 1- and 2- sepa- 
rations in a code, if they exist; and 

(b) given an exact 2-separation of a code C, there is a polynomial-time procedure that pro- 
duces codes Ci and C2, both equivalent to proper minors of C, and a permutation vr of the 
coordinate set of C, such that C = 7r(Ci 02 C2). 

However, direct sums and 2-sums are not enough for our purposes, nor were they enough for 
Seymour's theory of matroid decomposition. Seymour also had to extend the graph-theoretic tech- 
nique of 3-clique-sum to matroids (in fact, to binary matroids only). The corresponding operation 
on binary matroids is called 3-sum. 

4.2. 3-Sums. The special case of the S^{C, C) construction called 3-sum is somewhat more com- 
plex in definition than the 2-sum. Recall that for a binary word x, w{^) denotes its Hamming 
weight. 

Definition 4.2. Let C, C be codes of length at least seven, such that 

(Al) no codeword of C or is of the form ... Ox, where x is a length-3 word with w{:x.) G 
{1,2}; 

(A2) no codeword ofC or C is of the form x ... 0, where x is a length-3 word with w{x.) € 

{1, 2}; and 
(A3) . . . 0111 G C and 1110 . . . G C. 

Then, S3{C, C) is called the 3-sum ofC and C, and is denoted by C 03 C. 

It is perhaps worth commenting upon the use of the terms "2-sum" and "3-sum" to denote codes 
of the form Sm{C,C') for m = 1 and m = 3, respectively. The nomenclature stems from the 
analogy with the A;-clique-sum of graphs, wherein two graphs are glued along a fc-clique. Note that 
a 2-clique is a single edge (hence m = 1) and 3-clique is a triangle of three edges (hence m = 3). 
This also explains why we do not consider an operation of the form 82(0, C). 

3-sums are only defined for codes having the properties (A1)-(A3) listed in the above definition. 
It is obvious that an equivalent statement of (A1)-(A3) is the following: 

(B 1) ... 0111 is a minimal codeword of C, and has no nonzero codeword supported entirely 
within the last three coordinates of ; and 
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Figure 2. The minimal trellis of the code C ©3 C of Example |431 

(B2) 1110 ... is a minimal codeword of C, and C'"*" has no nonzero codeword supported en- 
tirely within the first three coordinates of C'"*". 

In fact, (Bl) and (B2) above are exact translations of the matroid-theoretic language used by Sey- 
mour in his definition of 3-sum [42 1. Another equivalent way of expressing these conditions is the 
following: 

(Bl') 0...0111isa minimal codeword of C , and the restriction of C onto its last three coordinates 
is {0, 1}^; and 

(B2') 1110 ... is a minimal codeword of C, and the restriction of C onto its first three coordi- 
nates is {0, 1}^. 

The equivalence of (Bl) and (Bl') is a consequence of the easily verifiable fact that if ... Dili G 
C, then has no nonzero codeword supported entirely within the last three coordinates of C"*- if 
and only if all possible 3-bit words appear in the last three coordinates of C. The equivalence of 
(B2) and (B2') is analogous. It follows immediately from (Bl') and (B2') that C©3C' can be defined 
only if min{dim(C),dim(C')} > 3 and max{d{C),d{C')} < 3. 

Example 4.3. Let C and C be the [7,4] Hamming codes given by the generator matrices G and G', 
respectively, below. 

1000011" 
10 10 1 

10 110 ■ 

1 1 1 1 

C and C satisfy the conditions in Definition \4.2\ so their 3-sum can be defined. The code C ©3 C 
works out to be the [8, 4, 4] extended Hamming code with generator matrix 

"10010011" 
_ 10 10 10 1 

00110110 ■ 

1 1 1 1 

The minimal trellis of this code is shown in Figure^ 

With J = {1, 2, 3, 4}, (J, J'^) is an exact 3-separation ofC ©3 C. It may be verified that, in fact, 
X{C ©3 C) = 3. Furthermore, puncturing any coordinate ofC ©3 C yields a [7, 4] Hamming code. 
Thus, C and C are (up to code equivalence) minors ofC ©3 C. 

The dimension and minimum distance of C©3C' can be related to C and C in a manner analogous 
to Proposition 14. 1 I for 2-sums. 

Proposition 4.7. For codes C and C of length n and n' for which C^sC can be defined, we have 

(a) dim(C©3C') = dim(C) + dim(C') - 4. 

(b) d{C®3C') < mm{d{C \{n - 2, n - 1, n}, d{C' \{1, 2, 3})}. 




1 1 1 
10 10 10 
110 10 
11110 
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Proof. We will only prove (a) as the proof of (b) is analogous to the proof of Proposition 14. li b). 
Let X II x' denote the concatenation of binary sequences x and x'. 

We prove the proposition by first showing that dim(C Ha C) = dim(C) + dim(C') — 1, and then 
showing that dim(C ©3 C) = dim(C II3 C) — 3. The first of these equalities follows directly from 
the observation that the mapping 

(/.(c II c') = c II3 c', cgC,c'gC', 

defines a homomorphism from the direct sum C © C onto C II3 C, with dim(ker((/))) = 1; indeed, 
ker(0) consists of the two words and . . . 0111 || 1110 ... 0. 

To prove that dim(C ©3 C) = dim(C II3 C) — 3, we observe that C ©3 C is isomorphic to the 
subcode, 8, of C II3 C consisting of those codewords C1C2 . . . c„+„'_3 such that c„_2C„_ic„ = 000. 
Therefore, dim(C ©3 C) = dim(£'). 

Since the restriction of C onto its last three coordinates is {0, 1}^ (property (Bl')), and the restric- 
tion of C onto its first three coordinates is {0, 1}^ (property (B2')), the restriction of C \\z C onto 
its (n — 2)th, {n — l)th and nth coordinates is also {0, 1}^. Therefore, dim.{£) = dim(C II3 C) — 3, 
and hence, dim(C ©3 C) = dim(C II3 C) — 3, which completes the proof of the proposition. □ 

An important difference between 2-sums and 3-sums is that the result of Proposition 14.21 does 
not directly extend to 3-sums. The reason for this is that for codes C and C satisfying (A1)-(A3) 
in Definition 14.21 the 3-sum C"'"©3C'"'" cannot even be defined. Indeed, while (Al) and (A2) are 
invariant under the operation of taking duals, (A3) is not — if . . . 0111 G C, then . . . 0111 ^ C"*-. 
To determine the dual of a 3-sum, we need to define a "dual" operation, namely the 3-sum. 

Definition 4.3. Let C, C be codes of length at least seven, such that 

(Al') no codeword of C or is of the form ... Ox, where x is a length-3 word with w{x.) G 
{1,2}; 

(A2') no codeword ofC or C is of the form x ... 0, where x is a length-3 word with w{x.) € 

{1, 2}; and 
(A3') . . . 0111 G and 1110 . . . G C'"^. 

The 3-sum ofC and C, denoted by C ©3 C, is defined as 

CffisC = C©3C^, 
whereC = C IJ (0 . . . 0111 + C) andC' = C' U (mO • • • + C). 

Note that (Al') and (A2') are identical to (Al) and (A2), respectively. To ensure that the above 
definition can in fact be made, it must be verified that the 3-sum C ©3 C can actually be defined for 
codes C and C satisfying (A1')-(A3'). So, let C and C be codes satisfying (A1')-(A3'). We need to 
verify that C and C' satisfy (A1)-(A3) in Definition 1421 

By their very definition, C and C satisfy (A3). Furthermore, since C C and C C C'^ ", we 

see that no codeword of C is of the form ... Ox as in (A 1), and no codeword of C is of the 
form xO ... as in (A2). Finally, if ... Ox G C for some length-3 word x with w{'k) G {1, 2}, 
then since ... x ^ C by (Al'), it must be that ... x is in ... 0111 + C. But in this case, 
0. . .Ox G C, where x = 111 + x. So, w{x.) G {1,2} as well, which is impossible by (Al'). 
Therefore, C cannot contain any word of the form ... Ox as in (A 1). By analogous reasoning, C 
cannot contain any word of the form x ... as in (A2). We have thus verified that C and C satisfy 
(A1)-(A3), and so C ©3 C' can be defined. 

Note that (A3') implies that . . . 0111 ^ C and 1110 . . . ^ C, and hence, dim(C) = dim(C) + l 
and dim(C') = dim(C') + 1. Therefore, by virtue of Proposition 14.71 we have the following result. 

Lemma 4.8. dim(Cffi3C') = dim(C) + dim(C') - 2. 
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The 3-sum is the dual operation to 3-sum, in a sense made precise by the proposition below, the 
proof of which we defer to Appendix El This result is stated, without explicit proof, in 1251 p. 316] 
and [48, p. 184]; Truemper [I25II48I refers to 3-sum and 3-sum as A-sum and y-sum, respectively. 
The result can also be derived from Theorem 7.3 in llTSll . 

Proposition 4.9. Let C and C be codes for which C ©3 C can be defined. Then, 

(C©3C')^ =C^©3C'^• 
It follows from the last result that a code that is expressible as a 3-sum can also be expressed as 
a 3-sum. Indeed, if C = Ci ©3 C2, then by the above proposition, = ©3 C^. The latter, by 
definition, is Cj^©3C^, and so again taking duals and using the above proposition, we obtain 

c = (ci®,ciy = (ciY ©3 (c^y . 

We record this as a corollary to Proposition 14.91 
Corollary 4.10. IfCi®'iC2 can be defined, then 

Clffi3C2 = {ciY ©3 (C^Y ■ 



Having presented some of the simpler properties of 3-sums, we next state a highly non-trivial 
result of Seymour that illustrates how 3-sums are to be used. The statement of this result is the 
3-sum analogue of Theorem 14. 31 but there are some important differences between the two that we 
will point out after stating the result. 

Theorem 4.11 (1421, Theorems 2.9 and 4. 1). IfCi and C2 are codes of length ni and n2, respectively, 
such that C = Ci ©3 C2, then (J, J^) is an exact 3-separation of C for J = {1, 2, . . . , ni — 3}. 
Furthermore, ifC is 3-connected, then Ci and C2 are equivalent to proper minors ofC. 

Conversely, if {J, J^) is an exact 3-separation of a code C, with min{| J|, \ J'^\} > 4, then there 
are codes Ci and C2 of length | J | + 3 and | J'^ | + 3, respectively, such that C is equivalent fo Ci ©3 C2. 

A couple of key differences between the statements of Theorems 14. 3 1 and |4. 1 1 1 must be stressed. 
For a code to be expressible as a 2-sum, it is sufficient that there exist an exact 2-sepaiation. How- 
ever, to make the analogous conclusion about 3-sums, Theorem l4.11l not only asks for the existence 
of an exact 3-separation (J, J^), but also adds the additional hypothesis that min{| J|, \ J'^\} > 4. 
We will have more to say about this a little later. 

There is a second major difference between the statements of the two theorems. Theorem 14.31 
states that if C = Ci ©2 C2, then Ci and C2 are always minors of C, up to coordinate permutation. 
However, when C = Ci ©3 C2, Theorem 14. 1 1 1 imposes the condition that C be 3-connected in order 
to conclude that Ci and C2 are equivalent to minors of C. If the 3-connectedness requirement for C 
is dropped, the conclusion does not hold in general, as the following example shows. 

Example 4.4. Take Ci to be the [7, 4, 1] code with generator matrix 

"1000000" 
_ 10 10 1 
^ ~ 110 10 ' 
1110 10 
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Figure 3. The minimal trellis of the code C = Ci ©3 C2 of Example I4H 

and let C2 be the [7, 4, 3] Hamming code with generator matrix 

"1000110" 
_ 10 10 1 

0010011 ■ 

1 1 1 1 

These codes satisfy (A1)-(A3) of Definition \4.2\ and their 3-sum, Ci ©3 C2, is the [8,4, 1] code C 
generated by 

'10000000" 
10 10 11 

00110101 ■ 

1 1 1 1 

The minimal trellis of this code is shown in Figure\3l Note that C is not 3-connected. In fact, it 
is not even 2-connected — it has a 1-separation (J, J^) with J = {1}. Now, Ci can be obtained 
as a minor of C by puncturing C at the last coordinate. However, C2 is not a minor of C, since 
puncturing C at any coordinate does not yield a [7, 4, 3] code, and shortening always yields a code 
of dimension less than 4. 

We mention in passing that if C = Ci ©3 C2, then the fact that Ci and C2 are equivalent to 
minors of C whenever C is 3-connected is far more difficult to prove (see 1421 Section 4]) than the 
corresponding part of Theorem 14. 31 

We observed above that the mere existence of an exact 3-separation is not enough for Theo- 
rem |4?TT] to conclude that a code C is expressible as a 3-sum; the 3-separation (J, J^) must also 
satisfy min{| J|, \J^\} > 4, i.e., must not be minima^. It is also implicit in the statement of Theo- 
rem l4.11l that the existence of a non-minimal 3-separation is a necessary condition for a code to be 
a 3-sum. Indeed, if C = Ci ©3 C2, then Ci and C2 must each have length at least 7, as per Defini- 
tion 14.21 So, with J = {1, 2, . . . , ni — 3} as in the statement of Theorem l4.1 1[ it must be true that 
I J| > 4, and similarly, \J'^\ > 4. 

The following definition allows for a compact statement of a corollary to Theorem 14.111 along 
the lines of Corollary 14.51 

Definition 4.4. A 3-connected code is internally 4-connected if all its 3 -separations are minimal. 

Internal 4-connectedness is a notion that lies properly between 3 -connectedness and 4-connectedness 
— a 3-connected code that is not 4-connected can be internally 4-connected. Note that any 3- 
separation in a 3-connected code must be exact, and so we can state the following corollary to 
Theorem 14. Ill 

Corollary 4.12. A 3-connected code C is not internally 4-connected iff there exist codes Ci and C2, 
both equivalent to proper minors ofC, such that C is equivalent to Ci ©3 62- 



The definition of a minimal fe-separation is given immediately after Definition |3.1| 
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As in the case of 2-sums, we provide an efficient construction of the components of the 3-sum 
when an exact, non-minimal 3-separation (J, J^) of C is given. This construction furnishes a proof 
of the converse part of Theorem 14.111 Our description is based loosely on the constructions given 
in t48, Section 8.3] and 125]. 

Let C be a code of length n and dimension k, specified by a A; x n generator matrix G, and let 
(J, J'^) be an exact 3-separation of C, with | J| > 4 and \J'^\ > 4. By permuting coordinates if 
necessary, we may assume that J = {1,2,..., m} for some m such that 4 < m < n — 4. Let 
G\j and G| jc denote the restrictions of G to the columns indexed by J and J^, respectively; thus, 
G = [G\j G\jc]. Let rank(G[ j) = ki and rank(G| jc) = k2; since (J, J^) is an exact 3-separation 
of C, we have ki + k2 = k + 2. Note that ki < k implies that k + k2 > ki + k2 = k + 2, ?,o that 
k2 > 2; similarly, ki > 2. 

Bring G into reduced row-echelon form (rref) over F2. Permuting coordinates within J and 
within if necessary, rref (G) may be assumed to be of the form 

A O B 

o O 4,_2 c 

where Ij, for j = fei, ^2 — 2, denotes the j x j identity matrix, Ais aki x ([ J[ — ki) matrix, B isa. 
ki X (I J*^! — A;2 + 2) matrix, G is a (/c2 — 2) x {\J'^\ — k2 + 2) matrix, and the O's denote all-zeros 
matrices of appropriate sizes. As a concrete example, consider the matrix G given in Example [ 
which is indeed of the above form, with I J| = I J'^l = 4, /ci = A;2 = 3, 



and G = [ 1 1 1 



G 



(8) 





' 1 " 




' 


1 


1 " 


A = 


1 


, B = 


1 





1 




1 




1 


1 






The fact that the submatrix 



must have rank equal to rank(G|^ 



k2 implies 



O B 

that B must have rank 2. Hence, B has two linearly independent rows, call them x and y, which 
form a basis of the row-space of B. In particular, each row of B is either 0, x, y or x + y. 
Now, define the (ki + l) x (| J| + 3) matrix 

4i A D 
1 



ith row of D 



where D is a ki x 3 matrix whose ith row is defined as 

000 if ?th row of S is 

001 if ith row of i? is x 
010 if ith row of i? is y 
100 if ith row of i? is x + y; 

and the bottom row of Gi, represented by [0 1], is simply . . . 0111. 
Next, define the (A;2 + 1) x ([ + 3) matrix 

h O X 

O Ik,-2 C 



Go 



[Ik2 + 1 C"] 



where 



X 



x + y 

y 

X 



and G" 



X 
C 



A straightforward verification yields that if Ci and C2 are the codes generated by Gi and G2, 
respectively, then dim(Ci) = ki + 1, dim(C2) = A;2 + 1> and Ci, C2 satisfy properties (A1)-(A3) in 
Definition l4.2[ so Ci ©3C2 can be defined. The construction of Gi and G2 above is carefully crafted 
to ensure that all the rows of G are in Ci ©3 C2. Hence, dim(Ci ©3 C2) > rank(G) = ki + k2 - 2. 
However, by Proposition 14.71 we have that dim(Ci ©3 C2) = dim(Ci) + dim(C2) — 4 = fci + /c2 — 2. 
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Hence, dim(Ci ©3 C2) = rank(G), implying that G must be a generator matrix for Ci ©3 €2- Note 
that according to Theorem 14. Ill if C is 3-connected, then Ci and C2 are equivalent to proper minors 
ofC. 

The procedure described above can be formalized into an algorithm that takes as input a. k x n 
generator matrix G for C, and an exact, non-minimal 3-separation (J, J'^) of it, and produces as 
output a permutation vr of the coordinates of C, and the generator matrices of two codes Ci and C2, 
such that C = 7r(Ci ©3 C2). The codes Ci and C2 are both equivalent to proper minors of C. This 
procedure can be carried out in 0{k'^n) time for the same reasons as in the 2-sum case. 

For the matrix G in Example l4.3[ we find the matrices Gi and G2 to be 

1000011" 
10 10 1 
10 110 ' 
1 1 1 1 

which are indeed generator matrices of the two Hamming codes whose 3-sum is represented by G. 

It must be pointed out that Theorem l4.11l also holds when the 3-sums in its statement are replaced 
by 3-sums. This is a consequence of Proposition 13.21 which shows that (J, J^) is a 3-separation of 
a code C iff it is a 3-separation of the dual code C^. Hence, applying Theorem 14. Ill to C"*-, and 
dualizing via Proposition |4.9[ we see that the 3-sums in the statement of Theorem 14.1 1! can be 
replaced by 3-sums. In particular, we also have the following corollary to Theorem 14. Ill 

Corollary 4.13. A 3-connected code C is not internally 4-connected iff there exist codes Ci and C2, 
both equivalent to proper minors ofC, such that C is equivalent to Ci ©3 €2- 

Putting together Theorem 14. 6l with 1 4.121 and 14.131 we obtain the following theorem, which sum- 
marizes the code decomposition theory presented up to this point. 

Theorem 4.14. A binary linear code either is 3-connected and internally 4-connected, or can be 
constructed from 3-connected, internally 4-connected proper minors of it by a sequence of opera- 
tions of coordinate permutation, direct sum, 2-sum and 3-sum (or 3-sum). 

The decomposition of Theorem 14.141 can be carried out in time polynomial in the length of the 
code, since 

(a) the decomposition of Theorem l4.6l can be carried out in polynomial time; 

(b) as mentioned in Section [3l there are polynomial-time algorithms for finding non-minimal 
3-separations (i.e., 3-separations (J, J'^) with min{| J|, | J^|} > 4) in a code, if they exist; 
and 

(c) there is a polynomial-time procedure that, given an exact, non-minimal 3-separation of a 
3-connected code C, produces codes Ci and C2, both equivalent to proper minors of C, and a 
permutation vr of the coordinate set of C, such that C = tt(Ci ©3 C2) or, via Corollary 14. 101 

C = 7r(Ciffi3C2). 

Note that the theorem does not guarantee uniqueness of the code decomposition. 

4.3. Code-Decomposition Trees. A binary tree is a convenient data structure for storing a decom- 
position of a code via direct sums, 2-sums, and 3-sums. Recall that a proper (or full) binary tree is 
a rooted tree such that every node of the tree has either zero or two children. We will drop the ad- 
jective "proper" as proper binary trees are the only kind of binary trees we are interested in. A node 
without any children is called a leaf. Each non-leaf node has two children, and we will distinguish 
between the two, calling one the left child and the other the right child. 

Let C be a binary linear code. A code-decomposition tree for C is a binary tree T defined 
as follows. Each node V of T stores a triple (V.COde,v.perm,V.Sum), where v.code is a binary 
linear code, v.perm is either NULL or a permutation of the coordinate set of V.COde, and v.sum 
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G {0, ©, ©2, ©3, ©3 }• For each node V of T, the triple (v.code,v.perm,v.sum) must adhere to 
the following rules: 

(Rl) if V is the root node, then v.code is the code C itself; 
(R2) v.perm = NULL iff v is a leaf; 
(R3) v.sum = iff V is a leaf; 
(R4) if V is a non-leaf node, then 

(i) Ichild.code and rchild.COde are proper minors of V.COde; and 

(ii) the permutation v.perm applied to the sum 

(Ichild.code) v.sum (rchild.code) 

yields V.COde, 

where Ichild and rchild above respectively refer to the left and right children of V. 

In particular, (R4) ensures that for any node V other than the root node in the tree, V.COde is a proper 
minor of C. 

We will identify the leaves of any code-decomposition tree with the codes that they store. The- 
orem 14.141 guarantees that each code C has a code-decomposition tree in which each leaf is 3- 
connected and internally 4-connected. Such a code-decomposition tree will be called complete. A 
complete code-decomposition tree for C can be constructed in time polynomial in the length of the 
code C, since the decomposition of Theorem l4.14l can be carried out in polynomial time. Note that 
if C itself is 3-connected and internally 4-connected, then there is exactly one code-decomposition 
tree for it, which is the tree consisting of the single node C — or more precisely, the single node 
that stores (C, NULL, 0). 

Finally, a code-decomposition tree is called 3-homogeneous (resp. 3-homogeneous) if for each 
non-leaf node V in the tree, if v.sum G {©,©2, ©3} (resp. V.Sum G {©,©2, ©3 })■ Thus, in a 
3-homogeneous (resp. 3-homogeneous) tree, no 3-sums (resp. 3-sums) are used. It follows from 
Corollaries 14. 12| and |4. 13| that if a code-decomposition tree has a node of the form V = (C, vr, ©3 ), 
with C being 3-connected, then the subtree with V as a root can be replaced with a different subtree 
in which the root node is v' = (C, vr', ©3). Therefore, every code has a complete, 3-homogeneous 
(or 3-homogeneous) code-decomposition tree, which again can be constructed in time polynomial 
in the length of the code. 

Having described in detail Seymour's decomposition theory in the context of binary linear codes, 
we now turn to some of its applications. This theory mainly derives its applications from families 
of codes that are minor-closed, and such families form the subject of the next section. 

5. Minor-Closed Families of Codes 

A family of binary linear codes is defined to be minor-closed if for each C £ ^, every code 
equivalent to a minor of C is also in C Note that this definition automatically implies that a minor- 
closed family, (L, of codes is closed under code equivalence, i.e., if C G (i, then all codes equivalent 
to C are also in 

A non-trivial example of a minor-closed family is the set of all graphic codes, since any minor 
of a graphic code is graphic. We will encounter other examples of minor-closed families (regular 
codes and geometrically perfect codes) further on in this paper. We mention in passing another 
interesting example of such a family — codes of bounded trellis state-complexity. Recall from 
Section |3] that the state-complexity profile of a length-n code C is defined to be the vector s(C) = 
(so(C), . . .,Sn{C)), where Si{C) = dim(C|j) + dim(C| jc) - dim(C) for J = [j] c [n]. Define 
•Smax(C) = maxjg[„] S j(C). For a fixed integer w > 0, let TCu, denote the family of codes C such 
that there exists a code C equivalent to C with Smax(C') < w. Then, TC^ is minor-closed [321, 
ir33.l . A similar statement holds for the family of codes that have a cycle-free normal realization (cf. 
|[T9]| ) whose state-complexity is bounded by w. 
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A general construction of minor-closed families is obtained by fixing a collection T of codes, 
and defining Cjf to be the set of all codes C such that no minor of C is equivalent to any C € T . 
As an example, let T = {Ui ,C{K^)^ ,C{K'i^'i)^}, where n^ is the [7,4] Hamming cod^. 
By Theorem 12. 1[ £jr in this case is precisely the family of graphic codes. It is clear that £jr is a 
minor-closed family for any fixed T . In fact, every minor-closed family can be obtained in this 
manner. Indeed, let £ be a minor-closed family of codes. A code 2) is said to be an excluded minor 
of £ if P ^ £, but every proper minor of V is in <t. It is not hard to verify that a code C is in iff 
no minor of C is an excluded minor of <t. Theorem 12.11 is an example of such an excluded-minor 
characterization, and we will see more such examples (Theorems 15. 3l and l6. 21 ) further below. Thus, 
taking to be the collection of all excluded minors of (t, we have that = (tjr. A tantalizing 
conjecture of Robertson and Seymour asserts that any minor-closed family C of binary linear codes 
has only finitely many excluded minors ll23l Conjecture 1.2]. 

Let C be a minor-closed family of codes. By Theorem l4.14[ every C G ^ can be constructed from 
3-connected, internally 4-connected codes in £ using direct sums, 2-sums, and 3- or 3-sums. The 
converse need not always be true, i.e., it is not necessarily true that if a code C has a decomposition 
via direct sums, 2-sums, and 3- or 3-sums into codes in (L, then C € Of course, the converse does 
hold if C is also closed under the operations of direct sum, 2-sum, 3-sum and 3-sum. As usual, € 
is defined to be closed under direct sum (resp. 2-sum, 3-sum, 3-sum) if for any pair of codes in C, 
their direct sum (resp. 2-sum, 3-sum, 3-sum, if it can be defined) is also in We summarize this in 
the following proposition. 

Proposition 5.1. Let (t be a minor-closed family of codes that is also closed under the operations 
of direct sum, 2-sum, 3-sum and 3-sum. Then, the following are equivalent for a code C. 

(i) C is in (J. 

(ii) The leaves of some code-decomposition tree for C are in C 

(iii) The leaves of some complete, 3-homogeneous or 3-homogeneous code-decomposition tree 
for C are in C 

(iv) The leaves of every code-decomposition tree for C are in C 

Proof, (i) implies (iv) since the leaves of any code-decomposition tree of C are minors of C, and d 
is minor-closed. The implications (iv) =^ (iii) and (iii) =^ (ii) are trivial, (ii) implies (i) since C is 
closed under direct-sums, 2-sums, 3-sums and 3-sums. □ 

Since a complete code-decomposition tree of any code C can be constructed in time polynomial 
in the length of C, we have the following corollary to the above result. 

Corollary 5.2. Let <tbe a minor-closed family of codes that is also closed under the operations of 
direct sum, 2-sum, 3-sum and 3-sum. Then the following are equivalent statements. 

(i) It can be decided in polynomial time whether or not a given code C is in ^. 

(ii) It can be decided in polynomial time whether or not a given 3-connected, internally 4- 
connected code C is in <t. 

The first major application of results such as the above — the application which was in fact the 
motivation for Seymour's matroid decomposition theory — relates to totally unimodular matrices. 
A real matrix A is said to be totally unimodular if the determinant of every square submatrix of 
A is in {0, 1, —1}. In particular, each entry of a totally unimodular matrix is in {0, 1, —1}. Such 
matrices are of fundamental importance in combinatorial optimization and network flow problems, 
because total unimodularity is closely related to integer linear programming 1*28^1. 

A binary matrix is defined to be regular if its I's can be replaced by itl's in such a way that the 
resulting matrix is totally unimodular. Consequently, a binary linear code is defined to be regular if 
it has a regular parity-check matrix. It turns out that for a regular code, every parity-check matrix is 



'From now on, Ti.7 will always denote the [7, 4] Hamming code. 
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regular P8 l Corollary 9.2. 11]. Furthermore, given a regular binary matrix B, there is a polynomial- 
time algorithm that converts i? to a totally unimodular matrix by assigning signs to the I's in 
B HSl Corollary 9.2.7]. Thus, regular codes form the key to understanding total unimodularity. 
The following theorem, due to Tutte [49], provides an elegant excluded-minor characterization of 
regular codes. 

Theorem 5.3. A binary linear code is regular iff it does not contain as a minor any code equivalent 
to the [7,4] Hamming code or its dual. 

It follows from the theorem that the family of regular codes, which we will denote by ^R, is 
minor-closed, since it is of the form (^jf for = {Hj, Tif }. Furthermore, is closed under the 
taking of code duals, i.e., the dual of a regular code is also regular. This is because a code C contains 
Hj as a minor iff its dual C"*- contains Hj- as a minor. It can further be shown |[38l p. 437] that 9\ is 
closed under the operations of direct sum, 2-sum, 3-sum and 3-sum. 

Note that by Theorem 12.11 9^ contains the family of graphic codes, and hence, the family of 
co-graphic codes as well, which are codes whose duals are graphic. Using a long and difficult 
argument, Seymour B2l proved that the 3-connected, internally 4-connected codes in ^R are either 
graphic, co-graphic, or equivalent to a particular isodual code that he called Riq, which is neither 
graphic nor co-graphic. 

Theorem 5.4 ([38], Corollary 13.2.6). IfC is a 3-connected, internally 4-connected regular code, 
then C is either graphic, co-graphic, or equivalent to Riq, which is the [10, 5, 4] code with parity- 
check matrix 
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Thus, Seymour's decomposition theory shows that any regular code (and so by assignment of 
signs, any totally unimodular matrix) can be constructed by piecing together — via direct sums, 
2-sums, and 3-sums or 3-sums — graphic codes, co-graphic codes, and codes equivalent to Riq. 
Also, membership in the family of regular codes can be decided in polynomial time. Indeed, as 
mentioned at the end of Section |2l there are polynomial-time algorithms for deciding whether or 
not a given code is graphic. Given mm x n parity-check matrix H for a code, a generator matrix 
for the code can be computed using elementary row operations on H in 0{m?n) time. Thus, the 
dual of a code can be determined in polynomial time, and hence it can be decided in polynomial 
time whether or not a given code is co-graphic. Hence, from Corollary 15.2! and Theorem I5.4[ it 
follows that there is a polynomial-time algorithm for determining whether or not a given code is 
regular. The best such algorithm known is due to Truemper [47], which runs in 0((m + n)'^) time. 
Truemper's algorithm is also based on Seymour's decomposition theory, but it implements a highly 
efficient procedure for carrying out the decomposition. 

While the application of Seymour's decomposition theory to regular codes is interesting, it is not 
very useful, perhaps, from a coding-theoretic perspective. However, in the next section, we give an 
application that should be of some interest to a coding theorist. 

6. Application: ML Decoding 

The recent work of Feldman, Wainwright and Karger ifTSl shows that ML decoding of a binary 
linear code C over a discrete memoryless channel can be formulated as a linear program (LP). Recall 
that the ML decoding problem is: given a received word y at the channel output, find a codeword 
X € C that maximizes the probability, Pr[y|x], of receiving y conditioned on the event that x 
was transmitted. As observed by Feldman et al. , under the assumption of a discrete memoryless 
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channel, given a received word y = yiy2 ■ ■ ■ Un, the problem of determining argmax^gc Pr[y|x] 
is equivalent to the problem of finding arg min^^c (7) where 7 = (71, 72, . . . , 7n) is given by 

and (• , •) is the standard inner product on R". Here, for the inner product (7, x) to make sense, 
a binary codeword x = X1X2 • • • x„ G C is identified with the real vector {xi,X2, ■ ■ ■ , x„) G 
{0, 1}" C M". 

The above formulation shows ML decoding to be equivalent to the minimization of a linear 
function over a finite set C C {0, 1}". Let P{C) be the codeword polytope of C, i.e., the convex 
hull in M" of the finite set C. It can be shown that the set of vertices of P{C) coincides with C. 
The key point now is that over a polytope P, a linear function (j) attains its minimum value (pmin = 
min{(/)(x) : x G P} at a vertex of P. In particular, minxgc(7)X) = minxgp(c)(7, x), Thus, 
ML decoding is equivalent to finding a vertex of the polytope P{C) that achieves minxgp(c) (7, x), 
which is a classic LP. 

However, ML decoding of an arbitrary code is known to be NP-hard ||4l. So, in general, solving 
the above LP over the codeword polytope is also NP-hard. A strategy often followed in such a 
situation is to "relax" the problem. The idea is to look for a polytope that contains the code as a 
subset of its vertex set, but which has some property that allows an LP defined over it to be solved 
more easily. Such a polytope is called a relaxation of the codeword polytope P{C). 

A certain relaxation of the codeword polytope has received much recent attention lfT5ll . ll55ll . P4ll . 
This is the polytope which, given a code C of length n, and a subset H C C^, is defined as 

Q{H) = n p(h^), 

where P(h^) is the codeword polytope of the code = {c G F2 : (h, c) = (mod 2)}. Note 
that since C C h-^ for any h G C^, we have that P{C) C flhei? P{^^) = Q{H) for any H C C^. 
In particular, P{C) C Q(C^) for any code C. 

For any H C C^, the polytope Q{H) contains C as a subset of its vertex set, V{H). This is 
because C C Q{H) n {0, 1}", and since Q{H) is contained within the n-cube [0, 1]", we also 
have Q{H) n {0, 1}" C V{H). Thus, Q{H) is indeed a relaxation of P{C). Consequently the 
LP minxgQ(//)(7, x), where 7 is the vector defined via Q, constitutes a relaxation of the LP that 
represents ML decoding. 

Now, any standard LP-solving algorithm requires that the LP to be solved have its constraints 
be represented via linear inequalities. The advantage of using the relaxation Q{H) is that there 
is a convenient such representation of the constraint x G Q{H). The polytope Q{H) can also be 
expressed as (see e.g. [|15l Theorem 4] or [55, Lemma 26]), 

Q{H) = fl n(supp(h)), (10) 
where for S c [n], 11(5) denotes the polyhedron 

n(5)= fl I (xi,...,xO e [0,ir : ^x, - x,<\J\-l\. (11) 

JCS I jeJ i€S\J I 

J| odd ^ ' 

The efficiency of a practical LP solver (like, say, the simplex or ellipsoid algorithm) depends on the 
size of the LP representation, which is proportional to the number of variables and linear inequalities 
forming the constraints. If H above consists of the rows of a parity-check matrix of a low-density 
parity-check (LDPC) code, then the representation of Q{H) given by ([TOl)- (fTTI) has size linear 
in the codelength n. So, the ellipsoid algorithm, for example, would be guaranteed to solve the 
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LP minxgQ(/f)(7,x) in time polynomial in n. However, as we explain next, this LP is no longer 
equivalent to ML decoding in general. 

Let J{H) = V{H) n {0,1}" denote the set of integral vertices {i.e., vertices all of whose 
coordinates are integers) of Q{H). We noted above that C C J{H). If is a spanning subset 
(over F2) of C"*-, so that the vectors in H form (the rows of) a parity-check matrix of C, then we in 
fact have C = J{H). This is because if x € {0, 1}" is not in C, then x ^ h"*- for some h G if, 
and hence, x ^ P{h.^) D Q{H). The polytope Q{H) in this case is the "fundamental polytope" 
of Vontobel and Koetter 155 ] (or equivalently, the "projected polytope" Q of Feldman et al. |[T5l 
p. 958]). The fact that C = J{H) for such a polytope Q{H) implies that the polytope has the 
following "ML certificate" property ITSl Proposition 2]: if the LP minxgQ(//) (7, x), where 7 is 
the vector defined via attains its minimum at some x G J{H), then x is guaranteed to be the 
ML codeword. However, it is possible that the above LP attains its minimum at some non-integral 
vertex x € V{H) — J{H), in which case decoding via linear programming over Q{H) fails. The 
non-integral vertices of Q{H) are called "pseudocodewords". 

It is naturally of interest to know when a code C has a fundamental polytope Q{H) (for some 
spanning subset HofC^) without pseudocodewords. For such codes, ML decoding can be exactly 
implemented as an LP over Q{H). Clearly, Q{H) has no pseudocodewords iff C = V{H), or 
equivalendy, P{C) = Q{H). But since P{C) C Q(C^) C Q{H), this obviously implies that we 
must have P{C) = Q{C^). Conversely, if P(C) = Q{C^), then we may simply take H = to 
obtain a fundamental polytope Q{H) without pseudocodewords. We record this observation as a 
lemma. 

Lemma 6.1. Let C be a binary linear code. There exists a spanning subset, H, ofC^ such that the 
polytope Q{H) has no pseudocodewords ijf P{C) = Q{C-^). 

A code C for which P{C) = Q{C^) holds will be called geometrically perfect, and we will denote 
by © the family of all such codes. So the question then is: which codes are geometrically perfect? 
An answer to this was provided by Barahona and Grotschel [2J, who showed that the relationship 
P{C) = Q{C^) is equivalent to Seymour's "sums-of-circuits" property for binary matroids [43]. 
The following theorem is thus equivalent to Seymour's characterization of binary matroids with the 
sums-of-circuits property. 

Theorem 6.2 ([2], Theorem 3.5). A binary linear code C is geometrically perfect iff C does not 
contain as a minor any code equivalent to Ti^, Riq or C{K^)^. 

By the above theorem, © is minor-closed. Moreover, since none of Ti^, Riq or C{K^)^ is 
graphic, no graphic code can contain any of them as a minor, and so graphic codes are geometrically 
perfect. 

Grotschel and Truemper [25, Section 4] showed that © is closed under the operations of direct 
sum, 2-sum and 3-sum, but is not closed under 3-sum. They also observed [25, p. 326] that any 
code in © can be constructed via direct sums, 2-sums and 3-sums from graphic codes and copies 
of the codes Ti-j, C{K^^^)^ and C{V%)-^, where Vg is the graph in Figure jH Indeed, this result is 
implied by Theorems 6.4, 6.9 and 6.10 in |[43l . It is not hard to verify that the codes Wy, C(K3 3)-*- 
and C{V^)-^ are in fact in ©. Putting these facts together, we obtain the following theorem. 

Theorem 6.3. For a binary linear code C, the following are equivalent statements. 

(i) C is geometrically perfect, i.e., P{C) = Q(C^). 

(ii) Each leaf in some complete, 3-homogeneous code-decomposition tree for C is either graphic, 
or equivalent to one of the codes Ti-j, C{K^^'i)^ and C(V8)^- 

(iii) Each leaf in every complete, 3-homogeneous code-decomposition tree for C is either graphic, 
or equivalent to one of the codes Ti-j, C{K^^?j)^ and CiV^)^. 
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Figure 4. The graph Vs. 



Proof, (i) impUes (iii) follows directly from the fact that any code in 6 can be constructed via 
direct sums, 2-sums and 3-sums from graphic codes and copies (up to equivalence) of the codes 
Hj, C(Er3 3)^ and C{Vs)'^. The implication (iii) => (ii) is trivial. Finally, (ii) =^ (i) holds since 
graphic codes and the codes H7, C(i^3^3)^ and C{Vs)^ are all in (S, and (S is closed under direct 
sum, 2-sum and 3-sum. □ 

Since a complete, 3-homogeneous code-decomposition tree for a code can be constructed in 
polynomial time, and testing for graphicness or equivalence to Hr, C(K3,3)-'- and C{Vs)'^ can also 
be carried out in polynomial time, we have the following corollary to the above theorem. 

Corollary 6.4. It can be decided in polynomial time whether or not a given code C is geometrically 
perfect, i.e., has the property P{C) = Q{C^). 

However, this is only half the story. If C is a geometrically perfect code, the algorithm guaranteed 
by the above result will determine this to be the case, but will not produce a "small" subset H C C-^ 
such that P{C) = Q{H). The only information we would have is that H can be taken to be the 
entire dual code C^. While it would then be true that the LP minxgQ(c±) (7, x) is equivalent to ML 
decoding, no known LP-solving algorithm could be guaranteed to efficiently solve this LP. This 
is because the representation of Q{C^) given by ([TOl)-(fTTI) has size exponential in the codelength 
n. Fortunately, as we shall describe next, for the family, 0, of geometrically perfect codes, ML 
decoding can always be implemented in time polynomial in codelength, not using an LP-solving 
algorithm, but by means of a combinatorial optimization algorithm that uses code decompositions. 

In a series of papers B31 - H71 (see also |48 1), Truemper carried out a careful examination of ma- 
troid decompositions, from which a particularly interesting observation concerning geometrically 
perfect codes could be inferred. Let 0o be the sub-family of (S that consists of all graphic codes and 
codes equivalent to one of Tij, C{K^^^)-^ and C{Vs)'^. As observed in the proof of Corollary 6.6 in 
fl5\ . Truemper 's analysis of matroid decompositions could be used to show that a 2-connected code 
C G — ©0 is equivalent to either Ci ©2 C2 or Ci ©3 C2, for some Ci S ©0 and some 2-connected 
code C2 G (5, both of which are equivalent to minors of C. The decomposition of C into Ci and 
C2, along with the coordinate permutation that takes Ci ©2 C2 or Ci ©3 C2 (as the case may be) to 
C, can be determined in polynomial time. These facts have some significant consequences, one of 
which is that any code in © is ML-decodable in polynomial time. However, rather than state these 
results just for the class of geometrically perfect codes, we will state and prove them more generally 
for codes that are "almost graphic" in the sense that they can be composed from graphic codes and 
finitely many other codes. 

Recall that F denotes the family of graphic codes. 

Definition 6.1. A minor-closed family of codes <t is defined to be almost-graphic if there exists a 
finite sub-family D d <t such that any 2-connected code C £ (tis either in T UD, or is of the form 
T^iCi ©2 C2) or 7r(Ci ©3 C2), for some permutation vr of the coordinate set ofC, and some codes Ci 
and C2 such that 



A CODE DECOMPOSITION THEORY 



27 



(a) Ci and C2 are equivalent to minors ofC, and 

(b) Ci € r U 2) and C2 is 2-connected. 

If there exists a constant I > such that for any length-n code C G (L — {TUD), the components 
Ci, C2 and TT of the above decomposition can be determined in time 0{n^), then the family of codes 
^ is said to be polynomially almost-graphic (PAG). 

Note that the 3-sum is conspicuous by its absence from the above definition. We will give an 
explanation of this at the end of this section. 

Definition 16.11 clearly implies that any code in an almost-graphic family <t has a 3-homogeneous 
code-decomposition tree. The definition in fact implies that a 2-connected code in C has a decom- 
position tree with the property that each leaf is in T U 5), and for each non-leaf node V in the tree, 
Ichild.COde is a leaf (and hence, is in F U 5)), where Ichild is the left child of V. Such a code- 
decomposition tree will be called (F U D)-unary. If C is PAG, a 3-homogeneous, (F U D) -unary 
decomposition tree can be constructed for any 2-connected code C € £ in time polynomial in the 
length of C. 

The family, F, of graphic codes is trivially PAG. From the discussion prior to the above definition, 
the family, C5, of geometrically perfect codes is also PAG. Other examples of PAG families are the 
code families (T^ (cf. Section[5l) for JT = {n7,C{K5)^} and J" = {Hjr ,C{K5)^}, and the family 
of co-graphic codes without a C{K5)-^ minor |[25l Corollary 6.6]. PAG codes inherit some of the 
properties of graphic codes. For example, it is known OTl . llSTI that the ML decoding problem over 
a memoryless binary symmetric channel can be solved in polynomial time for the family of graphic 
codes using Edmonds' matching algorithm |[T3l . |[T4l . A much stronger decoding result can in fact 
be proved for graphic codes, and more generally for PAG codes. This is based on the following 
optimization result proved in 1251 . an argument for which is sketched in Appendix iBl 

Theorem 6.5 ([25], Theorem 6.5). Let (t be a PAG family of codes. There exists a constant I > 
such that given any length-n code C in (t and any 7 € M", a codeword c^in € C achieving 
mincgc(7)C;) (or equivalently, minxgp(c) (7, x)) can be determined in 0{v}) time. 

It should be noted that an actual implementation of the polynomial-time algorithm implicit in 
Theorem 16.51 (and outlined in Appendix |B]| requires arithmetic over the real numbers, unless the 
vector 7 has only rational coordinates. So, in practice, finite-precision arithmetic used in any com- 
puter implementation of the algorithm could only approximate the linear cost function (7, c) for an 
arbitrary 7 € M". 

As mentioned earlier, ML decoding over a discrete memoryless channel can be formulated as 
a linear program [15]. Therefore, we have the following corollary to the above theorem, again 
with the caveat that a true implementation of a polynomial-time algorithm for ML decoding would 
require real-number arithmetic. 

Corollary 6.6. The maximum-likelihood decoding problem over a discrete memoryless channel can 
be solved in polynomial time for a PAG family of codes. In particular, geometrically perfect codes 
are ML-decodable in polynomial time. 

Another problem that is known to be NP-hard in general is the problem of determining the 
minimum distance of a code Il53l . For graphic codes, this is equivalent to the problem of finding 
the girth of a graph, which can be solved in time polynomial in the number of edges of the graph — 
one of the earliest such algorithms published [ 30] runs in 0(n'^/^) time in the worst case, where n 
is the number of edges. As a consequence of Theorem 16. 51 we also have that the minimum distance 
problem for a PAG family of codes can be solved in polynomial time. 

Corollary 6.7. The minimum distance of any code in a PAG family can be determined in polynomial 
time. 

Proof. Let C be a code of length n containing at least one nonzero codeword. For i = 1, 2, . . . , n, 
define 7*^*^ = (1, . . . , 1, — n, 1, . . . , 1), with the —n appearing in the ith coordinate of 7*^*^. Note 
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that miiicgc (t*^*^ , c) is always achieved by a codeword in C with zth coordinate equal to 1, if such a 
codeword exists. Indeed, the minimum-achieving codeword c^*) is the codeword of least Hamming 
weight among codewords in C that have a 1 in the ith coordinate; if there is no codeword in C with 
a 1 in the ith coordinate, then c^*) = 0. Note that if c^*) 7^ 0, then (7^*^ c^*)) = w{c^''^) — 1 — n. 
Therefore, the minimum distance of C is given by 

cZ = n + 1 + min min(7^*^ , c) . 

je[n] cGC 

For a PAG family of codes (t, given any code C G each of the minimization problems 
miricgc (7^*^ 1 c) can be solved in polynomial time, and hence the minimum distance of C can be 
determined in polynomial time. □ 

We point out that in all the minimization problems that must be solved (recursively going down 
the code-decomposition tree, as explained in Appendix E]) to determine the minimum distance of a 
length-n code, the cost vectors have integer coefficients of magnitude at most n?. So, the minimum 
distance of a code from a PAG family can be determined in polynomial time using finite-precision 
arithmetic. 

The downside of PAG (and more generally, almost-graphic) code families is that they are not 
very good from a coding-theoretic perspective. Recall from coding theory that a code family is 
called asymptotically good if there exists a sequence of [ui, ki, di] codes Q G (L, with linij rij = 00, 
such that liminfj ki/rii and liminfi di/rii are both strictly positive. 

Theorem 6.8. An almost-graphic famUy of codes cannot be asymptotically good. 

In particular, the family of geometrically perfect codes is not asymptotically good. The theorem 
is proved in Appendix ICl In view of Lemma [6m the above result has the following very interesting 
corollary. 

Corollary 6.9. Let <t be a family of binary linear codes with the following property: for each C € <t, 
there exists a parity-check matrix HforC, such that the corresponding fundamental polytope Q{H) 
has no non-integral vertices (pseudocodewords). Then, (t is not an asymptotically good code family. 

Loosely speaking, this means that linear-programming decoding, when applied to a "good" code, 
must suffer on occasion from decoding failure due to the presence of pseudocodewords, even if all 
possible parity checks (dual codewords) are used in the constraints ([TOl)- (fTTI) of the LP. Given the 
close relationship between linear-programming decoding and iterative decoding using the min-sum 
algorithm [54], a similar result is likely to hold for iterative decoding as well. 

We end this section with an explanation of why 3-sums were left out of Definition |6?T] The proof 
of Theorem 16.5 1 given in Appendix IB] relies crucially on the fact that a 3-homogeneous, (F U D)- 
unary code-decomposition tree can be constructed in polynomial time for a code from a PAG family. 
So, the result is actually true for any code family for which such trees can be constructed in polyno- 
mial time. Now, if 3-sums were allowed in Definition |6.1[ it is no longer obvious that codes from the 
resulting code family would still have 3-homogeneous, (F U 2))-unary code-decomposition trees. 
There is good reason to think that this could still be true, especially in light of Corollary 14. 101 which 
states that a code has a 3-sum decomposition only if it has a 3-sum decomposition. Indeed, that 
result may lead us to believe that if a code C has a (F U 2)) -unary code-decomposition tree which 
contains 3-sums, then replacing the 3-sums in the tree with 3-sums in the manner prescribed by 
Corollary 14. 101 should result in a 3-homogeneous, (F U 5})-unary code-decomposition tree. How- 
ever, to show that this is the case, it would have to be verified that if C = Ci ©3 C2, with Ci, C2 
satisfying (a) and (b) of Definition l6.1[ then (in the notation of Corollary 14. 101) Ci and C2 also satisfy 
(a) and (b). Now, it is not hard to check that if Ci is graphic, then so is Ci. However, unless C is 
3-connected, there is no guarantee that Ci and C2 are equivalent to minors of C, even though Ci and 
C2 are given to be equivalent to minors of C. 
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It is in fact quite likely to be true that if C = Ci ©3 C2, with Ci and C2 equivalent to minors of 
C, then Ci and C2 are also equivalent to minors of C. So, it is quite possible that if Definition 16. II 
were to include 3-sums as well, then the resulting code families would still have 3-homogeneous, 
(ruS))-unary code-decomposition trees, and hence Theorem 1 6 . 5 1 would continue to hold. However, 
a rigorous proof of this would take us far outside the main theme of our paper, and Definition 16.11 
as it stands is good enough for our purposes. 

7. Concluding Remarks 

A natural question to ask upon studying the decomposition theory presented in this paper is 
whether one can define /c-sums for A; > 4 that have the same attractive properties as 2-, 3- and 
3-sums. Ideally, such a fc-sum, denoted by (Bk, would have the following property for some fixed 
integer I > k: 

a fc-connected code C has a /c-separation {J, J'^) with min{| J|, | J^|} > / iff C = 
vr(Ci (Bk C2) for some permutation tt of the coordinates of C, and codes Ci and C2 
equivalent to minors of C. 

It is indeed possible to define A;-sums in such a way that we have C = 7r(Ci 0^ C2) iff C has a 
^-separation (J, J"^) with min{| J[, \ J'^\} > /. The tricky part is ensuring that the component codes 
Ci and C2 are retained as minors of C, and this appears to be difficult in general. However, we have 
some preliminary results that indicate that even without the last property, such A;-sums can be used 
as the building blocks of a decomposition theory that ties in beautifully with Forney's theory of 
cycle-free realizations of linear codes ||T9l . This theory would make further deep connections with 
matroid theory, particularly with the notions of matroid branchwidth and treewidth ll22l . jlTl . An 
exposition of this theory will be given in a future paper. 

While the decomposition theory in this paper has been presented mainly in the context of binary 
linear codes, it is possible to extend some of it to linear codes over arbitrary finite fields as well. The 
definitions of minors and /^-connectedness can be obviously extended to nonbinary codes. Also, a 
2-sum operation can be defined for codes over an arbitrary finite field F, and the entire theory 
outlined in Section RTT] does carry over. However, there is no known 3-sum operation defined for 
nonbinary codes that has a property analogous to that stated in Theorem 14. Ill Again, it seems that 
if we are prepared to give up the requirement that the components of a A;-sum be retained as minors 
of the composite code, then it is possible to develop a powerful decomposition theory for nonbinary 
codes just like that for binary codes. 

We end with a pointer to a very interesting direction of current research in matroid theory. This 
involves the resolution of two conjectures whose statements (in the context of codes) we give below. 
Recall from Section [5] that the notation (L^, for some fixed collection JF of codes, refers to the set 
of all binary linear codes C such that no minor of C is equivalent to any C G We extend that 
notation to codes over an arbitrary finite field F as well. 

Conjecture 7.1 ( li23l . Conjecture 1.2). If (t is a minor-closed class of codes over a finite field F, 
then <t = for some finite collection of codes T. 

Informally, the above conjecture states that any minor-closed class of codes is characterized by 
a finite list of excluded minors. 

Conjecture 7.2 (|l23l. Conjecture 1.3). Let hA he a fixed code. Given a length-n code C, it is 
decidahle in time polynomial in n whether or not C contains M. as a minor. 

The two conjectures together imply that the membership of a code in a minor-closed class can 
always be decided in polynomial time. To put it another way, if a property of codes is preserved 
under the action of taking minors, then it should be decidable in polynomial time whether or not a 
given code has that property. It should be pointed out that both conjectures have been shown to be 
true in the context of graphic codes, as part of the celebrated Graph Minor Project of Robertson and 



30 



N. KASHYAP 



Seymour |[39]l . B0l . The Graph Minor Project has had a profound impact on modern graph theory 
ll35l . and its extension to F-representable matroids (equivalently, codes over F) is bound to have a 
similar influence on matroid theory and, as a consequence, on coding theory. 

Appendix A. Proofs of Propositions I4.2I and |4.9I 

Proof of Proposition \4. 2\ Let C and C be [n, k] and [n', k'] codes, respectively, for which C ©2 C 
can be defined. We want to show that (C ©2 C')^ = ©2 C'^. 
First, observe that, by Proposition 14. 1 f a). 

dim((C ©2 C')^) = {n + n -2)- dim(C ©2 C) = {n + n' - 2) - {k + k' - I) 
= {n-k) + {n' -k')-l = dim(C^ ©2C'^). 

Therefore, it is enough to show that ©2 C'^ C (C ©2 C')^ . 

Given a binary word x = X1X2 . . . x„ and a positive integer m < n, let denote the length-m 
prefix of x, and let x^: denote the length-m suffix of x. Also, we denote the concatenation of two 
binary words x and x' by x || x'. 

Consider an arbitrary codeword, x, of C"*- ©2 C'^. Such a word is of the form X[.„_i] || x|,^,^.j 

for some x = (xi, . . . , € and x' = {x'l, . . . , x'^,) G C'"*" such that Xn = x'^. We will show 
that X[.,„_i] II as above must also be in (C ©2 C')"*". 

Let c be an arbitrary codeword of (C ©2 C). Such a codeword is of the form C[.„_i] || c'^,_j^.j 
for some c = (ci, . . . , c„) £ C and c' = (c'^^, . . . , c^,) G C such that c„ = d^- The dot product c • x 
evaluates to ^Y^=\ '^i^i + Y17=2 ^i^'v addition operations being performed modulo 2. But since 
c G C and x G C^, we have Yl^=i ^i^i — 0' from which we obtain Yl^=i ^i^i — (^nXn- Similarly, 
X]"=2 ^'i^'i = Hence, c • x = c„x„ + c'^x'^ = 0, since c„ = c'j^ and x„ = x'^^. As c is an 

arbitrary codeword of (C ©2 C), we have shown that x G (C ©2 C')"*". 

Therefore, ©2 C'"^ C (C ©2 C')"^, which completes the proof. □ 

The proof of Proposition |4i9] closely resembles that of Proposition 14.21 so we continue to use the 
notation introduced in the latter proof. 

Proof of Proposition \4. 9\ Let C and C be [n, k] and [n' , k'] codes, respectively, for which C ©3 C 
can be defined. Observe that, by Proposition 14.7 1 and Lemma l48l we have 

dim((C ©3 C')^) = {n + n -Q)- dim(C ©3 C) = {n + n - Q) - {k + k' - A) 
= {n-k) + {n' -k')-2 = dim((:^ ©3 C'^). 

Therefore, to prove Proposition |4.9[ it is enough to show that C"*" ©3 C'^ C (C ©3 C')"*". 

It is easily seen that an arbitrary codeword, x, of ©3 C'"*" must be of the form X[.„_3] || xj^, g.j 

for some x = {xi, . . . , x„) G and x' = {x[, . . . , x'^,) G C"*" such that {xn-2,Xn-i,Xn) = 
(x'^, X2, X3). We will show that any such x is also in (C ©3 C')^. 

Let c be an arbitrary codeword of (C ©3 C), so that it is of the form C[.„_3] || cj^/_3.] for some 
c = (ci, . . . , c„) G C and c' = (c'l, . . . , c^,) G C such that (c„_2, c„_i, c„) = (c^, 4, c'3). The dot 
product c • X evaluates to Y17=i '^i^i + Sr=4 '^i^i' addition operations being performed modulo 
2. But since c G C and x G C"*", we have X^ILi ~ 0, from which we obtain Y17=i ^i^i ~ 
YTi=n-2 (^iXi- Similarly, YJLa c'ix[ = Y.l=i (^A- Hence, c • X = E[=n-2 ^i^^i + ELi c-a; which 
equals since c„+j_3 = and x„+j_3 = x\ for i = 1,2,3. As c is an arbitrary codeword of 
(C ©3 C), we have shown that x G (C ©3 C')^ . 

Therefore, ©3 C'^ C (C ©3 C')"^- which completes the proof. □ 
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Appendix B. Sketch of Proof of Theorem 16.51 

We will only provide a sketch of the proof of Theorem |63J as it is a result extant in the literature 
ll25l Theorem 6.5]. The purpose of sketching out the proof is that it outlines the polynomial-time 
algorithm that determines 

min(7,c), (12) 

cGC 

for a length-n code C from a PAG family, and a cost vector 7 G M". 

The proof of the theorem is by induction using the following lemma and the fact (explained 
further below) that the minimization of a linear cost function can be done in polynomial time for 
graphic codes. 

Lemma B.l. (a) Let C = Ci (B2 C2- Then the minimum in f li2D can be obtained by solving two 
minimization problems of the form miiicgCi ('^' ^) '^^^ "'^^ problem of the form miiicgCa (/^j *^)- 
(b) Let C = Ci ©3 62- Then the minimum in f li2D can be obtained by solving four minimization 
problems of the form miiicgCi ("^j ^) '^^^ ""^ problem of the form mincgC2 (/^i 

Proof. Let n, ni and 722 denote the lengths of C, Ci and C2, respectively. Given 7 G M", let 

M = i + E-=i h\- 

(a) Let C = Ci ©2 C2. Define a^o) = (af ^ , . . . , ) and a(^) = {a'i^ a^V ) as follows: 

(0) _ f 7i i = l,...,ni-l 
\ M i = ni 

(1) _ f 7i i = l,...,ni-l 
I -M i = Til 

For j = 0,1, determine fij = miricgCi (o*^"''') c), and a minimum-achieving codeword c^-^^ = 
{c[^\..., c^nl) G Ci. By choice of M, we have that c^n} = 0, while c^n} = 1. 
Now, define (3 = . . . , j3n2) as follows: 

^1 - /^o + ^ = 1 

7ni+i-2 «=2,...,n2. 

Solve for /i = minceC2 (/^) c), and find a codeword c = (ci , . . . , c„2 ) S C2 achieving this minimum. 
We claim that mincGc(7) c) = //q + and that a minimum-achieving codeword in C is 

C ■ = / ^'^1°^ ' ■ ■ ■ ' • • • , Cna) if Ci = 

1 (4^\--->47-l'C2,---,C„2) ifci = l. 

We will first show that for each c G C, we have (7, c) > /Uq + /J. Pick an arbitrary c G C. There 
exists a unique pair of codewords x = (xi, . . . , x„J G Ci, y = (yi, . . . , y^j) G C2 such that 
= yi, and (xi, . . . , x^-i, 2/2, . . . , yna) = c. Suppose that x^^ = yi = 0. We then have 

(7,c) = (a(0),x) + (/?,y) > ^0 + /J- 
Next, suppose that = yi = I. In this case, we have 

(7,c) = (a«,x) -«« + (/?, y)-/?i 

= (a«,x)-(-M) + (/?,y)-(/ii-/io + M) 

> Hi - i-M) + Ji - {fii - fiQ + M) = fio + 'P- 

Thus, (7, c) > fiQ + Jl, as desired. 

It is now enough to show that (7, c^m) = ^0 + /2- By definition of Cmin, 

r (a(0),cW) + (/3,c) ifci = 

In either case, (7, Cmin) = no + 
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(b) Let C = Ci 03 C2- Note that from (A1')-(A3') in Definition |431 it follows that the restriction 
of Ci (resp. C2) onto its last (resp. first) three coordinates is {000, Oil, 101, 110}. 



Define a(^) = {Jf, 



,ani), J 



a. 



-a 



-a, 



0, 1, 2, 3, as follows: 

JO) 



7j for z = 1, . . . , ni — 3, and 

M 
M 
M 

For j = 0, 1, 2, 3, determine Hj = minceCi {a^-'K c), and a minimum-achieving codeword c^-'^ 



-a. 



a. 



(2) 
(3) 

a 



(4^ 



, ) € Ci- By choice of the a^^^'s, we have that 



(i) Ji) 



2' "'ni-l' '^ni ) 



000 


ifi 


= 


oil 


ifi 


= 1 


101 


ifi 


= 2 


110 


ifi 


= 3. 



Now, take /3 = , . . . , /^^a ) to be 

-(/xo + /ii - /U2 - /U3)/2 + M i 
-(/«o - W + /^2 - M3)/2 + M i 
-(iUo - /ii - /i2 + M3)/2 + M i 

Solve for /I = mincgC2 (/^) c)' ^^^^ fii^^ ^ codeword c = (ci , . . . , c„2 ) G C2 achieving this minimum. 
It may be verified that miiicgc (Tj c) = /Uq + A^. and that a minimum-achieving codeword in C is 



1 
2 
3 

4,..., 712. 



.(0) 



' ''ni-3' '^4) • • • ) Cnj; 

(1) . . ^ 

' ''ni-3' C4, • • • , Crij; 

(21 , . ^ 

' '^ni-3' '^4) • • • ) Cnj; 

(31 , . ^ 

' ''ni-3' C4, • • • , C„2; 



if (ci,C2,C3) = 000 
if (ci,C2,C3) = oil 

if (ci,C2,C3) = 101 
if (ci,C2,C3) = 110. 



□ 



The details of the verification are along the lines of that in part (a), and are left to the reader. 

Suppose that we have a PAG code family and must solve ([T2l ) for a given length-n code C € C 
and cost vector 7 = (71 , . . . , 7^) G M". Note that if C can be expressed as a direct sum, Ci C2, of 
codes C\ and C2 of lengths ni and n^, respectively, then 



min(7,c) = min (7'- c^"^^) + min (7*-^-', 
cec c(i)gCi c(2)ec 



,(2) 



where 7^^^ = (71 , . . . , 7„^ ) and 7^^) = (7ni+i , • • • , 7n)- Therefore, to show that (fT2l) can be solved 
in time polynomial in n, it is enough to show that there is a polynomial-time algorithm in the case 
when C is 2-connected. 

It follows from Definition 16.11 that there exists a finite sub-family 2) C C such that each 2- 
connected code C G C has a 3-homogeneous, (T U 2)) -unary code-decomposition tree, which can 
be constructed in polynomial time. So, an algorithm for solving ([T2l ) for a 2-connected code C € ^ 
would use Lemma IbTT] to recursively go down the code-decomposition tree starting from the root 
node, solving at most four minimization problems at each leaf of the tree. Recall that each leaf of a 
(ruS))-unary code-decomposition tree is a code in FUS). Since such a tree for a length-n code can 
have at most n leaves, the algorithm for solving (fT2l) would run in time polynomial in n, provided 
that there is a polynomial-time algorithm for solving (fT2l) for codes C € F, i.e., graphic codes. 

Indeed, there exists a polynomial-time algorithm for solving (fT2l) for graphic codes. Note that 
the minimization problem min(7,c) = min^"^]^ 7jCj over a graphic code C(^) is equivalent to 
the problem of finding the minimum- weight Eulerian subgraph in the graph Q whose edges Cj, 
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\V\ > 



i = 1,2, ... ,n, we given the weights 7j. An Eulerian subgraph of a graph is a subgraph in which 
each vertex has even degree. 

The Eulerian subgraph problem can be solved as follow^ Given a subset T of the vertices of 
Q, a T-join is a set of edges J of Q such that a vertex vinQ has odd degree with respect to (wrt) J 
if and only if ?; G T. Let N be the set of edges {ej : 7^ < 0}, and let T be the subset of vertices 
with odd degree wrt N . Define the graph Q' to be the graph Q but with edge-weights 7^' = |7j|, 
i = 1,2, ... ,n. Find a minimum-weight T-join, J, in Q' . The symmetric difference JIS.N is 
the required minimum-weight Eulerian subgraph of Q. A minimum-weight T-join can be found 
in polynomial time BTl Chapter 29], and so, a minimum-weight Eulerian subgraph of Q can be 
determined in polynomial time. 

Appendix C. Proof of Theorem 16. 81 

We will first prove that the family, F, of graphic codes is not asymptotically good. This is 
probably a "folk" theorem, but we could not find an explicit proof in the literature. Our proof relies 
on the fact |T| that for a graph Q = {V, E) with girth g and average degree 5 = 2\E\/\V\ > 2, the 
number of vertices satisfies the so-called Moore bound: 

1 + 6 E\=o^^\'^ - 1)' if 5 is odd 
2 - 1)' if 5 is even. 

For our purposes, the weaker bound \V\ > {6 — l)!^?/^]-! is enough, as we then have, for 6 > 2, 

loe-\V\ lofflT'l 
g<4+ ^^<4+ (13) 

where, for the sake of concreteness, log denotes the natural logarithm. 

Since codewords in C{G) correspond to cycles in G, we see that the minimum distance of C{G) 
equals the girth g of the graph G- Furthermore, if G is connected, then (as mentioned in Section O 
the rank of its vertex-edge incidence matrix is \V\ — 1, and hence, dim(C(t/)) = \E\ — {\V\ — 1). 
Thus, the rate of C{G) is 1 — {\V\ — 1)/\E\ ^ 1 — 2/6. Consequently, if a family of graphic 
codes has dimension growing linearly with codelength n, then by ( fT3] ) their minimum distance 
grows as O(logn), which implies that graphic codes are not asymptotically good. This argument is 
formalized in the proof given below. 

Lemma C.l. The family of graphic codes is not asymptotically good. 

Proof. We will in fact prove a stronger statement: for r € (0, 1), let F^ = {C G F : C has rate > r}; 
then, for any code C £ F^. with length n > 2, we have 

d{C) < r4^^. (14) 

^ ^ - log(l + r/2) 

Consider first an [n, k, d] code C € F^ with n > 2/r. Without loss of generality (WLOG), we can 
assume that C = C(^) for some connected graph G = {V,E) [38, Proposition 1.2.8]. Therefore, 
k/n = 1 — — 1)/|£'| > r, or equivalently, — < 1 — r. Furthermore, since n > 2/r, 

we have that |y |/|T'| < 1 — r/2. Therefore, the average degree, 6, of G is larger than 2/(1 — r/2), 
which is in turn larger than 2 + r/2. Hence, by (fT3] ). 

d < 4 + ; — —2 — < ^ 



log(l + r/2) - log(l + r/2) ' 

The last inequahty above holds when i^J^^ /2) — 4/3, which is true forn > 2/r, since iog{i+r/2) — 
4/3 for r G (0, 1), as a simple calculation will confirm. 



^This approach to solving the Eulerian subgraph problem was conveyed to the author by Adrian Vetta. 



34 



N. KASHYAP 



Therefore, without the assumption n > 2/r,we, have 

41ogn 



d{e) < max<^ 2/r, 



log(l + r/2) 

However, it is straightforward to verify that (2/r) log(l + r/2) < 1 for any r > 0, from which we 
obtain that for n > 2, ^^^^"l'^/^) > '^Ir, and ^ follows. □ 

Given an almost-graphic code family <t, let 2) be the finite sub-family of codes with the properties 
guaranteed by Definition 16. II WLOG, we may assume that 2) is minor-closed. 
For r e (0, 1), define Nj. to be the least positive integer such that for all n > N^, 

1 2 

< : ; ^ , . < 



log(l + (r - 2/n)/2) log(l + r/2)' 

Note that since lim„^oo 1/ log(l + (r — 2/n) /2) = 1/ log(l + r/2), such an Nr does exist. Now, 
define 

dmeoi{r, S) = max{(i(C) : C G C and has length at most Nr,orC G S)}. 
We now have the definitions needed to state the next result, which shows that codes in C can- 
not have both dimension and minimum distance growing linearly with codelength. It is clear that 
Theorem [63] follows directly from this result. 

Lemma C.2. Let C be an almost-graphic family of codes. For any r G (0, 1), ifC G C is an [n, k, d\ 
code with k/n> r, then 

d < max|(imax('^,S), - — 7rT~~77^r • (1^) 
I, log(l + r/2) J 

Proof. From the definition of dmaxC'^jS) and (fT4l) . it is obvious that the statement of the lemma 
holds for all codes in F U 2). The proof that the statement holds for all codes in C is by induction 
on codelength for a fixed r G (0, 1). 

So, fix an r G (0, 1). If no is the smallest length of a non-trivial code in £, then a length-no code 
in cannot be decomposed into smaller codes, and so must be in F U 2). Therefore, the statement 
of the lemma holds for the base case of length-no codes. 

Now, suppose that for some n > no, (fTSl ) holds for all codes C G £ of length n' < n — 1 and 
rate larger than r. Let C G be an [n,k,d] code with k/n > r. If C = Ci © C2 for some codes 
Ci and C2 in (J, then at least one of Ci and C2 has rate lai^ger than r, and so ([T5l) holds for C by the 
induction hypothesis. 

We may thus assume that C is 2-connected. IfC G FU2), there is nothing to be proved. So, either 
C = 7r(Ci ©2 C2) or 7r(Ci ©3 C2) for Ci, C2, vr as in Definition [611 WLOG, we may take tt to be the 
identity permutation, so that C is either Ci ©2 C2 or Ci ©3 C2, for some [ni, ki] code Ci G F U 2) 
and some [n2, k2] code C2 G 't. 

We consider the case C = Ci ©2 C2 first. By Proposition 14.11 k = ki + k2 — 1, and d < 
mm{d{C[), d(C0}, where C[ = Ci \{ni} and C'2 = C2 \{1}. Note that for i = 1, 2, C- is an [n^, k'^] 
code, where n- = n^ — 1 and fc- = fej — 1. Thus, k = k'^ + k'2 + 1 and n = n'^ + n'g. 

Now, if /c^/n^ > r for some i G {1, 2}, then the statement of the lemma holds for C by the 
induction hypothesis. So, we are left with the situation when k[/n^ < r for z = 1, 2. But in this 
case, since k/n = {k'l + feg + \)/{n'i + n'2) > r, we must have k'i/n[ > r — l/n[; otherwise, we 
would have k'2 > {n[ + n2)r — I — k[ > {n[ + n2)r — 1 — (rn'^ — 1) = rn2, which would mean 
that fca/n'g > r. Note that since Ci G F U 2), and C[ is a minor of Ci, we have that C[ G F U 2). If 
C( G 2) or n'^ < Nr, then ci(C() < (ijnax(''5 2)); otherwise, C{ is graphic with n[ > Nr, and so, by 
([141) and the definition of A^^^., 

^ , ^ 4 log n[ ^ 4 log n; ^ Slog n[ 

^ - log(l + (r - l/n;)/2) - log(l + (r - 2/n;)/2) " 1 + r/2' 
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In any case, d{e[) < max |dmax(?', 2)), iog(i+"/2) } < max ^d^^^{r,D), iog(i+r/2) }- ^i"*^^ 
d < d{C[), we have that O holds for C. 

Finally, we deal with the case when C = Ci ©3 C2. The approach is essentially the same as that in 
the 2-sum case. This time, defining C'l = Ci \{ni — 2, ni — 1, ni} and €'2 = C2 \{1, 2, 3}, we have 
d < m.m{d{C[),d{C2)}. For i = 1,2, we now find that C- is an [n-, k'-] code, where re- = — 3 
and k[ = ki — 2. Thus, n = n'^^ + re2, and via Lemma l4T8l k = k[ + k2 + 2. If either k[/n[ or k2/n'2 
is larger than r, then ([13]) holds for C by the induction hypothesis. So suppose that k[/n[ < r for 
i = 1,2. Since k/n = {k[ + /cg + l)/{n[ + re2) > r, we must have k[/n[ > r — 2/n[; otherwise, 
we would obtain k'2/n'2 > r. If CJ^ € 2) or n'l < Nr, then d{C[) < dmax(^;2)); otherwise, C'l is 
graphic with n[ > Nr, and so, by (fT4l) and the definition of A'^r. 

, 41ogre; 8 log 

- log(l + (r - 2/re;)/2) " 1 + r/2 ' 

In any case, d{C'i) < max |(imax(?'i 2)), iog^(i°f J/2) }' ^'^'^ since d < (i(Cj), we see that (1151 ) holds 
for C. The proof of the lemma is now complete. □ 
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